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The New Matrix for Design 


by Tom Williams, Ealtorin Chief 


he general dictionary definition of a matrix eallsit “a situa- 

tion or surrounding substance within which something else 

originates, develops, or is contained." No, itis not a com- 
utergenerated virtual reality fed ita the rains of comatose bu- 
mans producing power for space aliens. Despite tha, the deini- 
tion ideally its the characteristics of today's programmable logic 
devices. They area “surrounding substance” of gates fom which 
developers can now evoke the functionality of entire systems. As 
ares of advances in speed, density and limited degrees of spe- 
Cialization, FPGAs today represent a landscape tht is as yet not 
fully explored 

Within this issue, A7€ i proud to bring you a special supple- 
mentary section on this “New Matrix for Design.” a technology 
‘that is assuming roles that previously wer filled by eoprocessors, 
(CPUs, communications processors, DSPs, ASICs and, intrigue 
ingly enough, software. From their humble beginnings as “glue 
logic.” programmable logie devices have grown to include both 
soft and hard processor cores and the ability to incorporate both 
‘hardware and software functionality inthe same device. Not only 
‘that, they have the ability tobe reconfigured and updated through 
‘downloading new code while sil istalled in thet system, Some 
today even have the capability of being partially reconfigured in 
running system, 

Ta world that lives and dies by the whims of the twin de- 
mons, Cost and Time-to-Market, and their evil sister, Perfor- 
mance, FPGAs are helping developers move their ideas more 
Guickly into functioning reality. On one level, they simply are 
‘sed asa central part of a final design carrying IP cores, custom 
logic and software functions defined by the developer. On an- 
other level, they are used as prototypes to prove a design before it 
is committed to silicon, and on yet another they serve asthe step 
to implementation of a structured ASIC, which ean enhance the 
‘bottom line with smaller size, power consumption nd, above al, 
cost as volumes ramp up. ASICs have increasingly heen moved 
to those raified realms where only the highest volume produc- 








tion cam justify the expense and risk of the increasingly complex 
development cycle, 

FPGAs, when used as what might he termed “silicon soft- 
‘ware engines.” can accomplish what no van Neuman architecture 
cam achieve, and that is truly parallel execution. Without multiple 
processors or cores, normal CPUs have to “fake i” with inter- 
‘ups, context switches and deadline scheduling. FPGAs make it 
lok relatively easy, 

‘Another interesting development has to do with hardware 
fnventory. A manufacturer can now design a single hoardlevel 
hurdware platform consisting of peshaps a CPU and a set of 
FPGAs, and by incorporating different IP into the FPGAs, es 
sentially offer completely different prnuts tothe eustomer that 
sctually have the exact same hardware in common. 

OF course, it’s never al roses. We are still getting our bear- 
{ngs on how to fully utilize the potential ofthe FPGA and much 
{ssl tobe learned —theowgh experience, For ane thing, the de 
velopment tools are still, well, developing. We have come a long 
‘way since the days when un engineer entered a net ist into com 
pater, and even since FPGAs were primarily programmed using 
1 hardware description language of RTL. Now we are in an era 
‘when developers can define functionality using graphical tools ot 
ven familia software tools, sich asim "C-to-gates” scenario, 

Dehates rage ts to which approaches are hest and that i a 
good thing. Yet the FPGA has made steady progress from lowly 

-ahie” to important IC replacement to coprocessor and of 10 
sin processing clement, and its now in some cases the whole 
‘hebang” in many applications. That is not to say that CPUs and 
DSPs are eng replaced fur from i, In avast number of cases 
the technologies complement one another, Still, is exciting to 
se this versatile and pawserful technology coming into ts owa, 
tnd it will be even more intriguing to watch as engineers and 
‘developers discaver ever more mnavative ways that i ean be put 
to work solving the problems of high-end systems. 
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Software Compilation to FPGA Coprocessor Enables 


Supercomputing Adoption 

Coloxca nas announced software programming suppor forthe SGI RASC R100 blade 
‘tom Seon Graphics. Based on SG's Reconigurble Application Spectc Computing (RASC) 
technology, the RC1O0 computation bade packs the power of dozens of supercomputer 
nodes into a single blade by leveraging the parallelism of dual Xilinx Virtex 4 FPGAs. Using 
Celoxica’s DK Design Suite and libraries, the RC100 blades can be programmed directly by 
‘the end uso to accelerate custom © sofware eigoritins,overcoting the wadtional barter 
ta FPGADesed computing. 

“Te Celoxca enitonment speeds and sinpifes the programming of FPGA devices rom 
Clanguage software, enabling the acceleration of many high-performance computing (HPC) 
‘epptcavons ty orders of magnitude ever convertion! systems. Celca’s Tray forthe 
RASC RCI0O provides Clanguago calle to acces al the Sl core functions, risers, momo- 
tes and debug resources. Th OK Design Suite complies these software functions and user 
algorithms directly to custom hardware to take full advantage of the parallelism of the FPGA. 
‘devices and the performance ofthe SGI architecture. Celesca and SGI have als signed @ 
resale agreement making the Celexica programming environment avaiable through the SSI 
‘ales force. 
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Cavalli, general manager of the 
HypefTeanspor Technology 
Consortium. “HypeiTranspoet 
has proven to be the indutey's 
‘most exible, powerfland futute- 
ready standard iatrconncet 
Solution for compute-iensive 
fystem designs, delivering a 
‘ining combination of high 
pevformunce standardization and 
‘ptimized oa cot of nership 
(TCO) for data comer and 
supercomputing applications. 
HypovTeanpot 3.0 
tends the LA GH dual data 
fate’ (DDR) maximum cock 
of HypeiTranpoet 2.0 t0 18 
(Giz, 20 GHz, 24 Gite and 26 
Giz, and delivers a maximum 
aggregate bandwidth of 6 
Gis bandwidth increase of 
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Solafare Communications 
wnt Level $ Networks, two 
leading cary-sage Ethernat 
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raise new funding increasing the 
sh Bale ofthe new compat 
10 over $50 M. The merger was 


shared by the two companies 
| tw deliver high-performance, 
costeffestive,standands-ased 
there products that enable 
‘me network for comput, storage 
fh oetwork effi. The merged 
company willbe called Solarfare 
Communications and 
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In the QNX Neutrino RTOS, device drivers, fle systems, and 
protocol stacks all run outside ofthe kernel, as memory-protected 
processes. This architecture virtually eliminates memory corruptions, 
‘mysterious lockups, and system resets. Achieve maximum reliability 
and put an end to endless debug sessions, 
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AMC Offers System Design 
Flexibility 


Developed as an enhancement to ATCA, the AMC standard has gone 
beyond its original function of lowering system costs. Its combination 
of performance and size allow it to be used, with the right framework, 
to develop powerful yet compact systems. 


by Todd Wynia 
Artesyn Communication Products 


few yearsago.atthebeginning ofthe 10 packet-switched technologies and as blades used in the central office, nor did 
‘new millennium, it was becoming oth networking speed and demand grew, — they offer enough power density to handle 
clear thatthe popular CompactPCL CompactPCI was heginning to run out of the needed circuits 

framework could not addres all telecom- room for some applications. Cards were In conjunetion with telco operators, 

‘munications applications. As telecom- not large enough to contain the number PICMG defined the new AdvancedTCA, 

‘munications moved from circuitswitched of channels required in network server (ATCA) platiorm for next-generation 

telecomiminications system designs. The 
specification, adopted in 2003, offers 
many features that address the necds of 
modern telecommunications systems. It 
uses a high-performance, switched fab- 
ric serial backplane, operating at speeds 

‘of upto 10 Gigabits's per link. The fabric 

Provides a natural interface hetween the 

equipment and today's packet-switched 

networks, 

‘AdvancedTCA also incorporates 
the elements necessary 10 ensure high- 
reliability system designs. Tis includes 
fuiltin redundancy in essential system 
structures such as backplane channels and 
system management It also includes a ma 
tive ability 0 allow hot-swapping of cir- 
cuit cards so that the system can be main- 
‘ined without shuting it down, In addi 
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AMC modules can be hal-oFfuneight, single- or double-wide. These 
Szes allow front panel insertion into ATCA carter cards 
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‘will the board power up the rest of ts replacement at the module level instead! ‘The AMC module specification thus 
circuitry and come online. Ifthe board ofthe card level, This reduces the cost of results inconsiderahle versatility. Modules 
should fail im some way, the syslem man- component failure that carry fully loaded general-purpose 
ager can command the board to go off PICMG released the Advanced Mez- processors, DSPs or baseband network 


line and power down, preventing it from zanine Card (AMC) specification in 2005 processors, including a full complement 
disturbing the system while it is being re- to provide this finer granularity. These of memory and peripherals, are achiev 


moved and replaced modules incorporate the same IPMI in- able within the power budget and size 
terface and control features as the full eavelope of AMC modules. Other ahiey- 

AMC Improves on ATCA ATCA cards. They also feature several able functions include FPGA-based hard 
‘The original ATCA definition has attributes that contebute to substantial ware aeelerators, hard disk drives clock 

all the key features needed for next-gen performance generator and drivers, and intelligent YO 
ration tclecommunications systems, but One of the performance-enhancing that runs its own protocol stack. Simple 


improvements are avaye pombe, One feties ofthe AMC dcfuidon ithe em Gncoos, ich a cdnaty pile! 0 
idan that quickly arose was a means for size and power bulge. The card come in and high-speed function, such a an El) 
Teducing the cost of boars inthe sytem, four different configurations: half-height 1 infrface, an be pt on a modal and 
TACMG merabers soon realized nt dor andall-ight iniif-andfu-ices given plenty of frou pua! consections 
fningan appropriate mezzanine module (Figure 1 The diferent heights allow the 
‘wonld allow the development ofstandard- development of smaller modules contain- Emr Ssa 
Ten det for ey span uncon, ing enly Gnepoat, or ger cach 
Allowing mulpi vendors to offer ites that also have room for fron panel HO Cover pee 
Sngable tals aed thn. loweing. extinct ae echnical eck ue Dig sa Pensa 
‘heir oot fans and disk drives, The range of enextulo ard Disk res 

Te aie of mana cea poner alt a 1 ny ERAT) 
siamian alas vider ner gouty room for implementing tnt Fn abst ster 

Gener paos 0 


inthe development, inventory and main- tions and enough power torun them, 





tenance of systems, A cartier card and Advanced Mezzanine Card modules Dig Sip Procssur 
four different modules allow the creation are designe to allow front-pane! insertion saan 
‘of 24 card variations with only five tems into the carier card simplifying their re- Buliding multiple systems with 


to stock. This reduces inventory costs placement in an active system, This also AMC modules. 
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Even radio modules and RF amplifiers 
can be readily implemented, 

‘This design versity, coupled with 
the flexible rear-connector serial interface 
and the IPMI interface, allows an AMC 
‘mode to function almost as a miniature 
ATCA card in its own right, This eapa- 
bility dd not escape the notice of PICMG 
land the industry, and efforts were soon 
underway to define a structure that would 
allow use of AMC moddales in a system 
‘without a carrier card, Ratification of the 
resulting definition, MieICA, is ex- 
pected during 2006, 

‘MicrsTCA is essentially a backplane 
and chassis that serves asa virtual ATCA 
carvier card for AMC modules (Figure 
2) The backplane supports star, dua-star 
and full mesh configurations, This struc 





backplane. 


ture allows the creation of designs that 
fare lower in cost and more compact than 
ATCA systems while retaining signifi 
cant performance. Together, ATCA and 
MicrsTCA offer # wide range of options 
for trading off among cost, physical size, 
performance and capacity while employ 
Ing the same AMC modules, 

In order to act asa virtual ATCA car. 
rier, the MicrsTCA system must provide 
‘modules with the stme functons offered 
boy a carrier. These ince the switched 
fbr forthe rear connector interface, sys- 
tem clock snd elock distribution and shel 
‘management using IPMI, The MicrsTCA 





carrer lub provides these services to the 
hts, 


‘System Design with AMC 

MicroTCA is only a beginning. As 
long as AMC modules receive the neces: 
sry clock and management services they 
«an be used in any physical configuration 
‘that can contain them. Form-factors such 
asa Mito A cube are already under de- 
velopment, for instance, and even smaller 
“nano-systems” are possible. Further, 
the serial hackplane is readily extended 
through cabling, allowing the concatens- 
tion of cubes into chains oF even wrapping 
system around pole in a non-standard 
form-factor, without requiring alteration of 
either the modules or he logical structure 

Choosing among the ATCA, Mic 








‘The MicroTCA form-factor utlizes the versatility of the AMC module, 
sispensing with carier cards and plugging the modes directly into a 


ceraTCA, nano-system or a custom form 
factor is mostly & matter of trading off 
between size and the benefits that come 
‘with greater physical capacity: Using 
AMC cards i relatively straightforward, 
‘A system manager is required to run the 
‘communications protools and switching 
functions of the backplane, run the IPM 
interface and handle any high-availability 
software the system requires, Since AMC 
modules are not system-aware, external 
control is needed to handle system errors, 
failover and hot-swap functions. Beyond 
that, system development is more a mater 
of collecting the righ st of moddles, 


Only a few AMC modules in combi- 
ration are requited to provide the basis of 
‘many systems (Table 2).A media server, 
for instance, might need only a CPU mod- 
tule for configuration and control, a BSP 
‘meslile for decoding the media streams 
and a hard drive module for holding the 
media, A signaling gateway might con- 
tin a CPU along with an intelligent 10 
‘module, Add a DSP for encoding and de- 
coving. and the design becomes « media 
gateway. 

“Although born out of a deste to re= 
duce costs in ATCA systems, the AMC 
Standard is proving to have tremendous 
versatility. The modules can suppor hig 
performance functions, have the essential 
clemients of hot-swap capability and can 
‘operate as stand-alone cards, as well as a 
component ofa larger card. With the right 
framework, AMC modules ean form the 
basis of systems covering a full range of 
telecommunications applications. 





Artesyn Communication Products 
‘Madison, Wh 

(603) $31.50, 

Ibwwwartesyncp com) 





AGE Nay 2006 


Design to Deployment with 
One Graphical Environment 


Interactive algorithm design tools 
Native simulation capabilities 


alysis and I 











Prototype with LabVIEW 


Reconfigurable FPGA 0 








COTS prototyping platform 


Deploy with LabVIEW 
32-bit processor deployment - 


‘Same environment from algorithm design 
to implementation 


Graphical Development Platform for 
Design, Control, and Test 
Inte ented programming representation 
|| * Rend appicaon development 
+ But ino connect and redo aais 
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Converging Technologies 
Deliver Next-Generation 
Embedded Solutions 


Originally conceived as a companion technology for ATCA, the 
AdvancedMC form-factor is proving to be a versatile building block 
for systems architectures that combine a variety of new and legacy 
technologies. 


by Robert Persons and William Coffey 
Motorola 


ally relied on VME are driving tors and OEMs would ike to make incre- and have them work togeer, Rather han 
to tansform and modernize tit mentalchangestoexisingsyvemsand at slecting single stanard, the optimal 
systems to take advantage of new procer- fave to redesign all elements They would soliton wil be provided trough support 
sor architectures and fabric-basedsysiem keto reuse UO that has traditionally been by both VME and Micro TCA platforms 
fchilecures ax quckly as possible. This suppor on VME, whlle updating the for common, published, open sofware 
sfandands in terms of operating systems 
and midleware. I this convergence of 
rising and new technologies tha wil be 
ttiteal tothe modernization of high-end 
systems 
MicniTCA kveragesthe emerging co 
system ofthe Advanced Mezzanine Card 
(vanced MC) to create @ new, flexible, 
Sal oem factor plato important 
that system manufacturers and integrates 
derstand the capabilities that Mics TCA 
brings tothe embeded market 
"AdvancedMCs were orginally de- 
veloped for AdvancedTCA (ATCA) plat 
forms. ATCA has been targeted tthe 
atin ayer fhe ret “eco 
iinicatons network: onion” in between 
the outer edge “acces” devices and in 
fa lees) nner “core” switching functions. Here, 
MES ie print placed on a high compute 


capacity per blade, high wailablity, high 


Maisto poses a challenge because system integra- compute control core to new technologies 


‘Advanced TCA Advanced MC 




















MicraTCA based communications servers can leverage the same POSIX: 
‘compliant operating systems and management middleware fram ATCA. 
‘The Netplane suite of software from Motorola provides Service Avalabilty 
Forum (SAF}-complianthigh-avalabllty services independent ofthe 
Lnderying hardware of target application. 


“Gat Connested. 
wn congnies mentored sari 
sreratemagazine.com/pstconnecte| 
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The wireless future built on open modular solutions 


Oy eyeTA 


OPEN MODULAR SOLUTIONS 





nobile network applications 
ign and deploy with 
CA and AdvancedMC 






and enterprise world. 


Design your next application with Kontron. 





www.kontron.com/ATCA 
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handwidth fabric between the compute LO interfaces. Adding from one to eight 
blades and afew high-speed, high-density _AdvancedMCs to an existing ATCA pro: 
network UO interfaces. And, given the in- cessing, 1/0 or dedicated carrier blade 








vestment in ATCA chassis and blades, it incrementally extends the solutions pos 
is important to ave an ecosystem where sible from a single ATCA chassis, both 
many different solutions can be built horizontally xward the network edge and 





around the same fabric (and thus the same vertically within the aggregation layer 
‘or compatible blades) This leverages the investment in ATCA, 
‘AdvancedMCs bri 





further econo- into wide teleo applications 


mies of scale to ATCA in the form of a However, there are stil envison 
more granulae approach to the addition of ments where ATCA is simply either too 
either specialized processing elements or large of too expensive to serve as & base 


For data-critical 
networked applications 


‘Ache nd ofthe day minsion secs ris on the silty 
te spond on changer in an inane. Designers of ioe 
‘ial combat stems count on Real-Time Invasions 
(RT foe fu apd eclble standards based software to 
munca eabime data vera network RTT NDDS 
‘middleware pore the DDS standard adie widely sed 
‘nmr and seospace applications today. 


Terese in rng hw NDDS can wok in your appt 


tio? Dowload Us the DDS sonar for High Rei 
Applications rea worwsticom/etcO6 


RT 


2H Nay 2006 


architecture, or whese -48 VDC is aot 
an option. Thus, the idea of Mico CA 
as an architecture was born (around the 
same time AdvancedMCs were first com- 
ceived), where the same pay-as-you-go 
scalability could be realized i a much 








smaller, self-contained cartier assembly, 
land where 0a certain extent the chassis 
form-factors (and applications) are limit 
less. Here, a typically passive hackplane 
carrier can be mated with the appropriate 
type and numberof fabric “hub” modules 
to provide the bandwidth, protocol and re 
sdundancy required forthe application, 

Six AdvancedMC formtactors_ of 
varying component height and module 
width have been specitied, all leveraging 
the same high-speed, 170-pin edge con- 
rector. The MicrsTCA specification al 
lows for modular or monolithic chassis 
contiguratios from one cartier and one 
module to 16 carriers and 192 modules, 
while ensuring that modules always see 
the same “virtual” environment. ‘These 
MicrsfCA communications servers typi 
cally support two to three independent 
fabric interconnects on a carrier, where 
cach fabric “port” (differential transmit 
receive pai is capable of upto 6.25 Gites 
in each direction, and specific ports can 
he aggregated to form “fat pipes” with 
higher throughput. 

PICMG has defined AdvancedMC 
fabric interconnect standards based on 
Gigabit Ethernet/10 Gigabit Ethernet 
(GigE/NOGigE; AMC2), PCL Express 
(AMC) and Serial RapidIO (AMC, 
along with defining storage interconnee 
tions based on SATAVSAS or Fibre Chae 
nel (AMC... The MicrsTCA. specifica 
tion leverages these standards, allowing 
for switched andlor pointpoint fabrics. 
Current MicriTCA fabries typically ange 
from 1 Gbitis (one port) to 12.5 Gbitss 
(pons). The current aggregate cartier 
(ewitched backplane) bandwidth is around 
40 Ghitss, ut ext generation hubs sould 
exceed this; andthe MicrsICA specifica 
tion allows fr upto 12.5 Ghtss per pot. 

At present, AdvancedMC modules 
are available of under development for 
seneral purpose processing (386 & PPC), 
Digital Signal Processing (DSP), i 
tal Signaling (EL/TU1, OCs, DS3, et) 
Serial ATA (SATA) and Serial Attached 
SCSI (SAS) storage, Gigh/IOGigE, Wire 
less Broadband (WiMAX), Voice-overIP 


















(VoIP), VGA video, and even PCV/PCL 
Telecom Mezzanine Card (PMC/PTMC) 
While the majority are focused 
specifically on tlecommunicatons, more 
thin a few provide applications im other 
solution spaces, such as miliary, aero 
space, industrial and medical 
AdvancedMCs were designed to 
te into the highly wailable env 
ronment of ATCA, and this support car 
ries over to the MicrsfCA environment 
Offering hot swap, Intelligent Platform 
Management Interface (IPM) dynamic 
fabric negotiation, power budgeting and 
more, Micro CA\ covers the avalabil 
ity, serviceability and manageability re 
quirements of many target markets. In 
audition, unlike the ATCA blade fabec 
AdvancedMC fabrics can differ between 
MicrsfCA (and ATCA) carters (oitin 
the same of different shelves), allowing 














the most appropriate choice forthe appli 
cation. As such, the lawer everall cost of 
these modales will allow a less fabric-sen- 
sitive ecosystem to develop. 


\VME/MicroTCA Convergence 
At fitst blush, MiersPCA. appears 
to compete with VME, especially with 
jration VME fabric solutions, 
such as VXS (VITA 4) and VPX (VITA 
46). While there is clearly choice that 
must be made when considering general 
purpose computing platforms, VME con- 
Finses to be the logical choice in many 
military, industrial control and. medical 
imaging epplicatons, Because VME has 
ecosystem of COTS and custom 
ting military applications, it will 











LO ar 
continue to be a critical architecture for 

‘The purpose-built backward compat 
ibility of each successive revision to the 





VMEbus standard allows many VME 
ledge systems to he refreshed with single 
board computers that support POSIX op- 

systems, open midleware, 2eSST 
and multiple GigE interfaces. These SBCs 
‘an integrate into environments with eas, 
yet sill continue to interface with legacy 
UO devices. In many cases, they can aso 
‘communicate internally at fabric speeds 




















However, some of the traditional roles 
ME has played—such as concentrated 
digital signal processing systems 
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technologies such as network-centric Mi 
cersfCA communications servers. A het 
terogencous blend of VME-based 1/0 sub 
systems supporting traditional, VMEbus 
HO will exist, primarily a the edge of the 


allow the development of heterogeneous 
fevironments where common software 
architecture is used throughout the sys 





tem. VME and Micro CA-based systems 





will run common operating systems and 
middleware optimized for that architec 
ture. Applications running on VME pro- 
cessor blades wil share data through the 
middleware, using plug-in modules that 
‘optimize the transport for 2eSST. while 
MicrafCA systems will have transport 
Plug-ins optimized for GigE oF PCL Ex. 

Data. sharing etween systems 


system, interfacing to weapons or sensors. 
Concurrently, compute-centric functions, 

igital signal processing 
systems, cam take advantage of the com 











pute density of Micra CA. 


How Will it Work? 





Software standardization efforts will phic api 


press 


RIGOH ELECTRONICS, INC. 
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will continue to use GigE to share data bladesand MicrsTCA chassiscomponents of fabric interfaces, but this will require 
Fhetween applications running on differ. can be ruggedized for a variety of envi- a system retrofit. MiersTCA. has a muuch 
cnt systems. Abstracting to a middleware ronments, a Special Interest Group (SIG) _ smaller footprint and the ability to scale 
product, like DDS, eliminates the need for has been formed by a collection of com- better than a VXS-based system. In addi- 
pplication software to be written for one panies interested in MicrsfCA. The goal toa, the AdvancedMC ecosystem hs al- 
particular system architecture. Contrac- ofthe SIG is to allow the use of COTS ready begun to grow pri tothe release of 
tors can evaluate new technologies, such AdvancedMC modules im both commer the Micra CA system standard, 
as MiersTCA, as candidates for upgrad- cial and militarylindustrial applications The use of AdvancedMCs in & yari- 
ing hardware, while architects designing In addition the “ruggedization’ SIG isre- ety of applications both on ATCA carrer 
new systems can pick and choose the mast searching how new packaging techniques blades and within MicrTCA. communi- 
appropriate architecture for each part of can be applic to commercial Advanced- cations servers will help to substantially 
the system (Figure 1). MCs to harden them sufficiently forharsh drive down their cost. Moving forward, 
New technologies such as WiMAX environments and whether conduction- future ystems will beable to leverage Ad- 
(IEEE. 802.16) can also be deployed us- cooled systems can be designed around vancedMC flexibility. without the added 
ing MicrsTCA communications servers. them, IF COTS AdvancedMCs can he expense and infrastructure requirements 
WIMAX is a wireless broadband technol- used, the overall system cost willbe lower of ATCA. Furthermore, applications that 
‘ogy that is heing considered for systems and the variety of blades available to an cam benefit from the increased power en 
that will help contol the cost of wireless application will dramatically increase, elope of ATCA ean also bereft from 
high-speed transport. Platform variants de- The work of the SIG will be part of an the variety of AdvancedMCs that will be 
fined inthe MicroTCA standard, like cubes oficial standards body working group, so available for both environments. 
and Pico sheles, are well suited ta support the completion af a ruggedization specifi- 
network nodes hased on WiMAX. At the cation should coincide with the release of Motorola 








sme time, rugged variants of MicrsTCA the MicroT CA standard Tempe, AZ, 
communications servers will move Mi VME64x still offers a great deal (800) 759-1107, 
crGTCA ino the field and into vehicles. of variety, but at reduced backplane [www.motorola.com/computing}) 





In order to take this one step fare bandwidth, VITA 41 will increase the 
ther and to determine how AdvancedMC —hackplane bandwidth with the addition 
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The CPC5564 64-Bit AMD Opteron™ Single Board Cor 





“The CPC5568 is the world’s most powerlul PICMG® 2.16 compute bla, and the fst to be based on 
‘ingle- and dual-core AMD Opteron™ processors. The AMD Opteron™ pracestor provides a highly scalable 
86 architecture that delivers next-generation performance as wel a a flexible upgrade path from 32- to 
‘GA-bit computing. Its mult-core architecture offers advanced processing speed while reducing heat and 
power consumption, 





\With up to 8G8 of ECC memory, multiple storage options and Linux, Solars™ and Windows® operating system support 
the CPC556A isan ideal computer for high-end packet processing or multi-threaded software environments found in 
Wireless, softowtch and defense applications. 


{sitthe superhero of the compute work!? We lke to think so. 
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Non-Traditional Blade 
Server Applications 


As blade servers pack increasingly more punch into smaller form-factors, 
their use is spreading beyond data center applications. The integration 
of newer technologies makes blades attractive for high-performance 
computing purposes such as distributed computing, rendering/imaging 
and data analysis. 


by Laura P. Cooper 
Nextcom 


lade servers have gained ground increased redundancy, simplified server to run 24/7 are just a few of the reasons 
lover the past few years as an effi- management, easier hardware and soft- for large-scale deployments. These de- 
cient, condensed computing solu- ware integration, and lower heat dissipa-_ ployments are becoming more popular for 
tion in large data center implementations. tion and power fequirements. These im- telecom, telephone and cellular carters, 
‘They are widely used as replacements for provements aver traditional data cenler insurance companies, tax preparer, state 
pizza box servers and large rackmounts Servers provide for a massive increase in and local government agencies an educa 





Figure 1). eployable resource density and anoveeall tional institutions, among ethers. 
‘The advantages inherent in blade reduction in long-term costs “Although blade servers have a long 
server technology include smaller form- Each blade in a chasis is typically way to go before they are the standard 





factors, denser computing, expundabil- a self-contained server. Data cenler con- deployed technology throughout the data 
ity, hotswap capabilites, flexible and solidation, advanced communications and center, they are beginning to appear in 
fail-safe architecture, reduced downtime, remote management of servers required less traditional implementations such as 
high-performance computing (HPC), 
Data Center Blade Configuration 
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Newer Blade Technology 
Enables High-Performance 
Functionality 

Many of the characteristics that en- 
able the use of blades in HPC applications 
hve emerged in recently introduced blade 
serers. As they become increasingly 
smaller and more powerful blades are in- 
corporating technology such as open stan- 
dard architectures, mallcore processors, 
PCLexpansion for multiple 10 functional- 
ity, the ability to house mule OSs, low 
power processors and innovative eoling 
techniques, standart AC electrical con- 
rectvity, daisy-chaining and Gigabit 
Ethernet ports. When these are combined 


meee Gat Connected 
EI lade servers are gaining ground as an efficent, condensed computing th compares meted i rice, 
‘Solution in large data center implementations. Sees 


2AERIM May 2006 













































































‘withthe growing tend of virtualization, 
blade functionality increases even moe, 

‘The great advantage of an open ar- 
fitecture is that anyone cam design add 
fn products fori An open architecture 
also increases the potential for partner. 
ships that enable the integration of design 
ele management tools. By integrating 
the tools engineers already use within 
the same user interface, open design en- 
vironments enable greater productivity 
Innovative products, open standards and 
interoperability ate fundamental to the 
mainstream adoption of new services in 
the HPC world, 

The emergence of multicore pro 
ccessrs gives blade manufacturers even 
denser computing options. Chips such as 
‘Advanced Micro Devices’ dual-core Op- 
teron processor offer significantly greater 
computing performance than equivalent 
single-core devices, yet produce no addi- 
tional heat and do not require adltional 
power. The consolidated processing 
Strength of multiple cores enables next- 
generation systems and opens up new 
possibilities for very high-performance 
blade servers, 

“The reduction of bottlenecks among 
processors and system components in 
AMD's Direct Connect architecture, for 
example, makes possible more efficient 
tse of current system resources as well as 
tomorron's highren components (Figure 2, 
In particular, dual-core processing is well 
‘sited to large casters. The processing ca- 
pabilty of two cores on one chip makes 
possible server and workstation consol 
dation. The efficiency of this technology 
also opens up a new level of high-perfor: 
mance computing and the flexibility to 
design more innovative solutions that i= 
rectly address specific customer needs 

"The quest for greater performance in 
embedded and high-performance applica 
tions fis pushed systems manufacturers 
to incorporate PCI slots into blades. PCL 
fexpansion slots are Being implemented 
in newer blade servers, Higher perform- 
ing PCEX clock speeds up to 133 MHz 
jump throughput up into the gigahyte! 
second range. This speedy bus i ideal for 
Critical cards such as frame grabbers and 
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‘AMD 64 Technology with Direct Connect Architecture 
Sparta manera VO pats 
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Muticore processors combined with more efficient architectures 
can improve the use of system resources and highend components. 
{Counesy of Aavances Mr Devces) 


higher-end control cards. PCl-compatible munications systems and storage area net 
{raphicscandscan now headdeddiectlyto work products such as clustered servers. 
the server forse in backracking, machine A PCL-X solution usually offers 32-bit oF 
vision and imaging implementations, ‘4-bit modes and is backward compatible 
PCLX is viewed by many asa logical with existing PCI configurations 
way to combine more robust bandwidth The ability to house multiple OSs, 
and greatercosteticiency indatw-intensive previously a convenience, is quickly be- 
environments. Applications range fom coming & requirement in igh-perfor 
medical imaging to industrial controls to mance applications. Within a single rack 
Virtual private networking. as wellascom- the newer blades can each rum a different 














‘Advantages of Virtualizing in Server | Advantages of Virtualizing in Testing 
‘Applications: and Development Environments: 
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Virtualization partitions a server into several vitual machines, each able 
to run its own OS and application environment. 
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passive cooling techniques allows form 
factors to be reduced and blades to be 
housed in much smaller rooms than re 
ized previously 

Single-phase AC power sourcing isa 
major development in blade server teh 
nology. I simplifies electrical efiiencies 
and reduces “data center infrastructure 
requirements by eliminating the need for 
hard-wiring and expensive transformer 
circuitry. Older server technolog 











[EES Nextcom's reconfigurable NextSenver 416 incorporates the latest Pores specialized three-phase, 208V 
technology in a AU footprint to provide a solution for biade server input connections. The expense of rewir- 
consolidation, cluster and high perfermance computing and reliable dat ing a server room to accommodate this 
storage. type of connectivity figures largely into 

the overall otal eos of ownership. Newer 

08, cither in single oF dual boot mode, _As blades become smaller and more blades ar often designe o run on sam 





rn more efficiently, two concurrent efficient, low-power processors and in- dard 110/220V single-phase wall socket 
Viralized OSs. Mealy, a blade would novative cooling techniques are emerg- by utilizing advanced load-balancing and 
Provide interoperability with all major ing that minimize power and cooling integrated hot-swappable power distribu 
(0S offerings, including Windows Server requirements while increasing maximum tion buses that results in built-in redum 
and XP Pro, RedHat, SUSE, Fedora Core processor core density. A blade can fur-  dancy and substantially kn cost. 

Linux, and in some cases. even Solaris 0, ther reduce wattage with acombination of The ability to. daisy-chain several 
(OfEthe-shelf software applications could the proper management of power require- blade servers frees up more space and 
be integrated seamlessly and most custom ments andthe customization of hardware packs more computing horsepower into 
applications could be installed with litle configurations to mect the needs of the the same square fontage by stacking 
‘or no reconfiguration application and task. A mix of active and several chassis in a rack, Some blade 











ere 
ere 
a Ta 
The Final Piece of Your 
Embedded Packaging Solution 7a 
Ani cone obedient i! Yr we wh x 
tapi eae else or pe ew es, pp, pele nd neon 
Serb ly ond velar ts mle yr pon a eg semis 
end yous Wf eng dow te pa yous in rd ney gic. 
er ewe is bod wal pice ce rel 


ELMA on 


USA ma Electron In. Inge 


Mme SI 550406 Fr: SIE ASAON Ea esac Wx wlan 





26H Nay 2006 





manufacturers are now incorporating 
builtin keyhoard/video(mouse (KVM) 
‘switches and builtin inter-chasis da 
chain ports, which eliminate unnecessary 
Cabling and the need for separate external 
monitors, This can provide an aggregate 
interconnect rate of up to 10 Gbytes. 
Connectivity is also becoming more ef 
ficient to storage area networks via Fibre 
‘Channel and by the use of Gigabit Eber. 
ret for creting virtual LANS, 











Virtualization and Blades: The 
Dense Computing Solution 

Virwalization, an old idea in the 
server community, has recently re 
emerged as solution to the ever grow 
ing. problem of underutilized. physical 
servers It partitions a server into several 
"virtual machines” each capable of rin 
ning its own OS und application environ 
ment. When blade server technology is 
‘combined with virtualization, the resul 
is a massive increase in functionality. For 
example, VMware's middleware solution 
allo a host OS to simultaneously run 
fuest OS as a viral layer, with a mul 
titude of OS configuration options. The 
‘optimal virtualization implementation is 
applied tothe highest performing proces- 
sors. When high-performance capabili- 
ties are applied to the smallest avilable 
blade servers, virtualization isa its most 
clficient, enabling space minimization, 
system efficiency: and better application 
ullization (Figure 3), 











HPC Blade Implementations 

‘The decision to employ blade servers 
for noe-traditional applications is based 
fom the same benefits that make them it 
tractive for the data center. Distributed 
computing, renderinglimaging, number: 
crunching processes, tet and. measure 
ment data analysis, content manipulation, 
server appliances or gateways, and hetero 
‘gencous computing using mixed OSs are 
jist a few of the areas expected to utilize 
blades for HP. 

Computing appliance, or dedicated 
hardware, applications are demanding 
44 move from general-purpose comput. 
ing to & mode that is bots powerful and 
fleible, which combines the streng 
ofthe multi-purpose model with the ep- 
pliance concept. Simple, reliable devices 
Are required tht facilitate repeated tasks. 








Capabilities and flexibility, High-quality 





Used as appliances, newer Blades provide 
both the user interface and the “box.” By 
ullizing open standard architecture and 
‘oflthe-shel OSs developed for appliance 
applications, blades cam be fine-tuned to 

we the optimal performance af a specific 
desired service, while non-reguired ser. 
vices are disabled, 

[As the technology of instrumentation 
used for signal capture and analysis be- 
‘comes more complex. 3 does the need for 
‘computers providing immense processing 





EMBEDDED PLANET” 














measurement and analysis of incoming 
data is crucial, and high-resolution dis 
plays are required for visually rendering 
rics of that data. Aerial and satellite i= 
age analysis, automated mapping detec 
tion of human activity, change detection 
and perceptual organization are some of 
the processes that require intensive com 
putational processing 

‘Although signal analysis software 
that performs these functions often runs 
fn standard OSs such as Windows XP 
Pro, these complex caleulations require 
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ionsEngineering 


more intensive processing than a standard 
PC can prewide, Blades can process la 
amounts of data, as well as allow multiple 





‘monitor hookups. The same requirements, 
including imaging/rendering necds, spy 
to est and measurement data analysis, 








‘The integration blade technology 








being 
of distributed computing. Backracking, or 

singly being de 
ployed as the architecture of choice for 
both small and large desktop applications. 
Its hencits include the elimination of costs 
for moves, adds and changes, which in 


is poised to define anew 








creases system uptime; the centralization 
‘of support; and built-in security for hot 
data protection and disaster recovery In a 
hackracked environment a blade acts as a 
‘central point that houses the OS and appli 
cations, distributing them as requested. In 
this configuration, 100 or more comput 
cers can be stored in a single, centralized 
rack. With bales in use as desktop PCs, 








‘computing power is ensured while main 
tenance is consolidated and platforms and 
systems ean be quickly upgraded as new 
technologies become available 





Key vertical industries, incl 
and gas, are beginning to wllize blades 
for these non-traditional wpplications The 
newest blade servers are well suited to 





seismic data analysis, data manipulation, 

via FireWire interfaces 
Military applications are 
ining to utilize blades as well, such 








as signal detection and analysis, surveil 
lance, data analysis and manipulation, and 
visual rendering of data. Other industries 
include test and measurement and life si 
fences, both of which require intensive, 
rumbercrunching processes, as well as 
analysis, manipulation and rendering of 
data. A less obviows implementation is 
in the field of media, where digital video 
capture, digital content creation, editing 








and visualization are requiring increas 





ingly greater processing capabilities in a 


smaller space 





Many vendors are recognizing the 
need to address these less common de- 
Ployments of blade server technolo 
ies by adopting some, if nt all, of the 


requirements needed to perform these 
tasks, Por example, NestComs Next 
Server4i6 Figures) isahigh-availability 
fextreme performance platform that uses 
the latest technology to provide a solution 
for server consolidation, cluster and HPC 








along with reliable data storage in a small 
foorprint, 4U reconfigurable server 

The platform's open standard archi 
tecture supports AMD's dual-core Op- 
teron procestor and 64-bit Intel EM64T 
Xeon clustered or independent blade 
computing. Its performance and form 
factor fit the needs of embedded, distri 
uted computing, high-performance imag: 
ing and interconnect applications. The 
aggregate data rate is up to 10 Gbitss 
and the platform supports daisy-chained 
KVM, remote management and alarming, 
an blade hoe swappability 

Additional blade options include 




















bre Channel, Gigabit Ethernet, hard disk 
flash disk, PCI-X 1/0 expansion and SCS1 
External monitors ean be hooked up to 
cach blade for remote monitoring and 
Visualization, Multiple networks for dif 
ferent functions andior redundancy can 
run simultaneously: On-blade storage and 
external networkattached sor 

flexibility for partitioning applications 
and data, Blades ean be packaged with 
22-bit and 64-bit workstation and enter 
Prise server versions of Windows, Linsx 








and Solas 
Hi 
comput 





jh-performance, open standard 
is becoming 











place across an increasing. number of 
technology-rich industries. The recent 
implementation of this technology in 
blade servers allows non-traditional ap- 
plications to maximize efficiency and 
Performance. The use of blade servers 





for non-traditional, high-performance 
applications will likely increase, just 
blade servers will continue 1 incorporate 
even more functionality. Smaller for 
factors, earlier adoption of new develop- 
ments in multicore processor tecanoog 

increased flexibility and expandability 
and increased poser eliciency are likely 
to emerge as technology and application 
needs evolve, 
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10 Gigabit Ethernet 





Backplanes Make ATCA 
Chassis COTS 


Although Ethemet’s use in the backplane has been significant, its high 
latency has limited potential applications. Low-latency 10 Gigabit Ethemet 
switch chips and network interface chips are changing that picture. 


by Bud Noren 
Fulcrum Microsystems 





clement in the future of 10 Giga 
Ethernet isthe drive to lower ts 

latency and make it the de facto in- 
terconnect technology for COTS systems. 


‘As engincers begin to rethink chassis de- 
sign with the aim of transitioning their 
systems to the AdvancedTCA form-fac- 
tor, they are increasingly considering 10, 





‘ATCA Backplane Zonas 


‘TeATCA backoane edited Into 32a for data anspor, power and management 
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in transitioning their systems tothe ATCA form factor, engineers are 


Increasingly considering 10 Gigabit Ethernet fo use inthe backplane. 
Zone 1 of the ATCA backplane is used for power distribution and 
‘system management, zone 2for data transfer and zone 3 for access to 


‘ditional 0. 


Gigabit Ethernet asthe ultimate uniter of 
the data center or central oie 

‘With its effort to develop a standard- 
ized chassis form-factor, ATCA repre- 
sents one of the most significant thrusts 
into COTS systems that are currently un- 
dervay. The promise of ATCA is that a 
single chassis will beable to house boards 
from a variety of manufacturers and allow 
system designers to hay the right function 
ality for their applications without being 
held hostage toa vendor's closed system, 

‘AdvancedTCA alances this open- 
nes with achassis design that delivers the 
performance needed for next-generation 
{elecommunications systems. A key part 
ofthat is ensuring thatthe system back 
plane walks the line between high perfor. 
‘mance tnd off-the-shelf flexibility, In this 
cevironment, Ethernet is natural choice 
Fhecause itis already the dominant tech- 
‘logy’ used for the data flowing through 
the network: about 85% of all networks 
are Ethernet-hased, ‘That broad adoption 
falso mans thatthe supporting software 
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and human knowledge are widely avail- 
ble to implement systems. 

“This widespread usage ofthe technol- 
‘; in addition to its increasing perfor. 
‘mance, as driven interest among systems 
tuilders in using Ethernet im the back: 
plane, despite a perception that end-o-end 
latency and congestion mangement is in- 
feria o alternative technologies, Work to 
adopt Ethernet for use in the bockplane 
tna yielded solutions that possess the fea- 
tures and performance to compete with 
altemative interconnect technologies to- 
day, and also carry the promise of even 
better performance i the future 


Backplane Requirements of an 
ATCA Chassis 

The ATCA architecture is based on 
PICMG 2.0. Among other configurations, 
it calls for a 12U chassis with blades that 
fare SU high by 280 mm deep with 3 12> 
inch pitch. Total board power ean be up ta 
200 with forced-aircooling inthe boom. 
‘ofthe chassis. Power comes from a back 
plane standard that allows foe distribution 
‘of 48 VDC power tal of the cards. 

Multiple interconnect. technologies 
are supported inthe ATCA specification, 
including Etheenet, PCI Express and In- 
finan in dal-staroF mesh interconnec- 
tion, Multiple Links af upto 40 Ghats (For 
‘total capacity as high as 240 Terabits) 
fare supported, with 99:999% uptime and 
‘central ffice-evel qualty-of service. 

‘The backplane is divided into three 
zones (Figure 1). Zone 1 is for power dis- 
‘wihution and system management, zone 2 
is fordata transfer and zane 3 is for access 
tan optional Rear Transmission Modi, 
typically used for additional UO. Zone 2 
is further broken down into two parts. A 
hae interface thats specified as switched 
10/100/1000 Ethernet is used for system 
‘management and, optionally, for datapath 
traffic. An optional, higherspeed inter. 
face supports a higherthroushput data 
path, The fabric interface is flexible and 
fan be configured either as full mesh oF 
asa dual-starordual-dual star, depending 
‘upon the application, 

The stitching fabric is protocol-ag- 
nostic, thereby giving system designers a 
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‘Dual Star Backplane Fabric 
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Dualstar topology makes uso of redundant hub cards added to 
‘the chassis that provide the backplane switching fabric. Dua-star 


configurations are designed for applications with predictable traffic lows, 
2s well as for nonJocal trafic, such as multiplexing applications. 


choice hy defining several sub-specifica- 
tions for specifi interfaces and protocols, 
‘These inclade PICMG 31 for Ethernet, 
Which i also used forthe base interface, 
PICMG 32 for InfiniBand, PICMG 33 
for StarFabric and PICMG 34 for PCL 
Express, 

“The dual-star topology uses redun- 
‘dant hab cards that are added to the cha 
Sis to provide the backplane switching 
fabric. Each node card has a channel con- 
nection to the hub cards across the hack- 
plane and the huh card is responsible for 
Sovitching all dat A hackplane channel 
‘connects both hub eards to each other for 
redundancy. Similarly, a dual-dual star 
‘configuration, in which hub cards re used. 
to create two entirely separate and redun- 
dla fabrics with fabric interfaces to each, 
is also supported. This is useful in appli- 
‘ations where extra throught is needed. 


ATCA Chassis Configurations 

‘The dual-star configurations are 
designed for applications with predict. 
able traffic lows (Figure 2). More bursty 


traffic cam resul in higher latency as ome 
‘nde card consumes a. disproportionate 
share of the switch resources, Dual-star 
Inuckplanes are also a good design for 
‘on-local traffic, such as multiplexing 
plications. data packet that is switched 
to mukiple node cards within the switch 
is subject to extra latency associted with 
‘the transit hack and forth tothe hub card, 
‘A switch for this application should 
thave a good ow control mechanism to 
help regulate the ows from all of the 
‘node cards to alleviate congestion at the 
Inub board. Additionally, a low-aten 
switch provides the broadest applicabil- 
ity. Increasingly. applications are becom- 
ing latency sensitive, especially as ATCA 
chassis are being considered for storage 
land computing systems, Those applica 
tions wil he severely impacted by witch 
lenient that is not optimized for latency 
In contrast to the dual-star approach, 
full mesh topology is usually used in ap- 
plications dat have large data throughput 
‘needs, oF that require high levels of peer 
to-pecr processing from clements within 
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Mesh Backplane Fabric 
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‘Afull mesh topology works best for applications with high data 
‘throughput, o those requiring high levels of perto;peer processing rom 
‘elements within the chassis, 


the chassis. In fll mesh configuration, a ofthe other node cards in the chassis, 
‘hackplane switch resides oneach mode card The mesh design (Figure 3) offers 
and connects into a matrix of channels on higher scalability, system redundancy 
thebackplane, Thosechnnelsextend tall and transit efficiency, However, this 








comes at the cast of larger number of 
trace routes on the backplane, which can 
drive up the cost of this component. Ad- 
ditionally, the mesh configuration typi- 
cally requires. software enhancements 
to the applications to make them aware 
fof the any-to-any connectivity provided 
by mesh topology. The high cost of the 
backplane, as well as multiple backplane 
switch chips on each node card, make 
the full mesh appropriate fr smaller sys 

tems, or systems tht truly requie single 

hop lateney between node cards. 


Por 10 Gi 
in an ATCA backplane, the signals must 
‘he optimized for an electrical trace rather 
than fora copper or fier cable. The key 
10 Gigabit physical layer technology used 
fn the backplane is 10 Gigabit Attack- 
ment Unit Interface (XAUL. This has be- 
come the de facto standard for 10 Gigabit 
Ethernet chip-o-chip connections, and is 
Populi in bickplane applications because 
itis easy to implement. 
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‘The FocalPoint switch chip 
from Fulcrum Microsystems 
is one of several law iatency 
switches aimed at ATCA 
applications. It features 


XAUL is a parallel interface com. 
posed of four 3.125-GHz serial lanes, with 
8 low pin count for fewer traces across & 
circuit board, Since itis derived directly 
from 10 Gigabit Ethernet, it keeps many 





of that technology's important features, ‘twenty-four 10 Gigabit 
such as SB/10B encoding, making the Ethernet ports and a latency 
packet transition to XAUL fas, 0 200 ns, 


The XAUL interfce is easy to design 
with because itis sef-clocked, allowing 

ets to distibate the clock on theit 
boards, and also delivers an inherently 





low level of electromagnetic interfer 

cence. The interface can compensate for So even though NAUL was not made physical layer specification for 1) Gigabit 
muli-bit bus skew, which contributes to for backplane applications, it has become Ethernet that would result in a physical 
its relatively long-distance specification popular there. The IEEE is now undertak- layer standard optimized for hackplane 
(of 0S meters, ing the creation of a backpline-specific applications 
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Lower latencies in 10 Gigabit Ethernet switch chips are now on par with 
other backplane choices for ATCA designers, such as Fibre Channel, 


Myrinet, InfiniBand and PCI Express. 


802.3 AP 

“The IEEE. P802 Sap Task Force has 
been working on huckplane Ethernet 
since May 2004, with the goal of defining 


Ethernet operation over electrical hack- 
planes covering distances of up to 1 me- 
ter. This effort will combine the IEEE. 
802.3 media access controller (MAC) 





‘operating at Gigabit and 10 Gigabit speeds 
With three new physial-layer signaling 
standards: 10Q0BASE-KX, 10GBASE- 
KX¢ and 10GBASE-KR, The KX4 stan- 
dad i hased on XAUL 

‘The 10 Gigabit Ethernet standard 
will he defined for parallel connections, 
such as NAUL and for serial connections 
that will operate as a single channel at 
10 Ghitsls. The goal of the specification 
isto incorporate as much of the Ethernet 
standard as possible, inchuding the MAC 
frame format and important services en- 
coding, such 8 Spanning Tree, virtual 
LANs (802.1Q) and management infor 
mation (802.1 

‘One of the key drivers of the push 
for 802. ap is t better define the speci- 
fications for the seralizridserializer 
(GerDes) transceiver circuits. Tightening 
up these specifications will provide tighter 
tolerances for the interface between the 
board and the backplane, as well xs in 
‘corporate new management mechanisms 
that will make i easier 10 design and 
debug backplanes. The parallel standard 
could eventually be used for a 40 Ghats 
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Discover PowerNodsS, the ft rugged 
{BU VME version of the IBM J520™ 
‘dual PowerPC S70™ blade server 
PowerNadeS provides outstanding 
performance with full binary JS20 
‘compatiblty, now adapted ta todays 
fembedded systems requirements. 

te the right eolition when you need to 
‘combine leading-edge processor 
technology and legacy equipment ain & 
bighty robust package 

CChacea between standard convection 
‘cooled and rugged conduction cooled 
versions for’ harsh environment 
‘applications. Whatever solution you 
‘choose, youl gat dual ISM S70RX™ 
processors clocked at 1.6Ghz and up 
{2 2GB DOR SDRAM ECC memory wah 
‘an outstanding 6.4 GB/s memery peak 
‘bandit. Appications can be developed 
fon a low cost, standard IBM blads 


server and easily deployed 
on the PowerNodeS sje. 
What's more, thanks tothe 
‘smooth migration path 
fram PowerPC Alivac™ 
based platforms as well as 

oftware insulation layer 
common with previous 
versions your lgacysotaere 
inwostments are preserved. 
(Choose beowean board 
‘companent. versions (the 
PPowerNodeS) or the pre- 
integrated — PowerMP5. 
“Transport and management 
software are standarde-based and bath 
products run Fed Hat Linux or Wind 
Fiver Vitorks. No wondar more blue- 
chip companies are turing te ruggeced 
soltions rom Thales Computers to mect 


their ertical computing nocd. 














please contact 
“ot S398 98 16 93 95 
(eat: Ro@aaecargtre tr 














www.thalescomputers.cam, 


THALES 





34MM May 2006 


it 


‘hackplane, with each chanvel supporting Fibre Channel, Myrinet, InfiniBand and 
10 Gbitsis instead of today’s typieal 28. PCI Express (Figure 5). 


bits ‘These low latencies boost the effi- 
‘The IEEE commitice defining the ciency of Etherne’s existing low contol 

standard will keep Ethernet packet sizes capabilities by providing a quick feedback L 

thesameand will support the existing me- loop to sources sa that they back off heir INUX 


independent interfaces. The S02.3ap transmission rate unt the switch s clea, 
Standard is now in draft form and should This allows the switch to maintain wire 


Bcompcedin November fthisyear te permis wate git tutes | | SCOPE -JTD 


Designers can therefore use standard, 




















Congestion Management. offtho-shelF technology to achicre their LinuxScope™JTD 
‘The IEEE is also tackling congestion goals before the standards hoes Gish (UTAG Target Debugger) 
management, another issue thal should their work is aplgin debugger 
fguarely ene the move toward 10 Ethernet for ATCA is ready for forthe Eipse “DE 
Gigabit Ethernet backplanes. The #02 ar prime time today. Because of its high thathas been optimized 
study group grew out of the recognition pecformanc, low latency, ubiquity 6 foruse with Abaton's 
by the backplane Ethernet committe that stem and proven ability to transport {812000 BDMUSTAG probe. 
moving Eternet into these epplictions any type of data eflcientl, can Belp 
increases the sensitiv to frame delay, turn ATCA systems int the epitome of & J Features 
lay variation ad packet los, COTS device a 
ere’ current flaw conto eapa- A Dalit cts 
tilties were designed for environments Fulcrum Microspstoms G smarcne 
whore atney and delay variation ae not Calabass, CA. tt ashy 
key issues, The goal of the commitice (SIS) 1-810 igs 
ist find «new congestion management www Salcrummica.om Pierce pentacttttre 
scheme that ean augment these capabil J tpneion nine 


ties. More specifically, the new standard R “torneo ney an 





wil specify mechan Yo sepor he ees 
Communication of congestion inform G Enoresin races 
tran limite data rat on ta Etre E heb os 
link to back off tac ile congestion ae Soe a 
clea All this done witout canging aaa ow oat 
the MAC or pial layer nterfaccs and pe tara 


‘while minimizing the throughput reduc 
tion in non-congested flows 


Low-Lateney Ethernet Provides 
a Solution Today 

“Although the efforts of the 802.39p 
and 802.3ar groups will improve Ethernet 
for ATCA, loy-ateney Ethernet solutions 
that utilize XAUL are available now for 
ATCA chassis designers. These can pro- 
Vide the performance necessary for the 
most demanding compating and comma 
ications applications 

‘Several vendors have debuted low 
lateney switch chips that help overcome 
flow control issues and make high-perfor- 
mance ATCA backplanes & reality, even 





forlatency-sensitiveapplications like voice Ultimate Solutions, Inc. 
and video. Switch chips are now available 10 CeverLane: Tei MA 01876 
With latencies a low a8 200 nanoseconds Tolle 866455 383 Ph 978-455.3383 
(Figure 4) This isan onler of magnitude Fac9789263001 Emal.nfodutsolcon 
improvement over the previous generation wwwultsol.com 


fof Ethernet switch chips snd om par with 
“other choices for ATCA designers, stich as 
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Achieving 10 Gigabit 
Ethernet and Beyond 


The transition to higher speed Ethernet, from 1 GbE to 10 GbE and 
beyond, is already occurring. Several technical concems need to be 
addressed throughout the network in order to achieve these higher 


23 ayers 





‘The same MAC and reconciation sublayer ae used fora 10 GDE 
levies, but the PHY for connecting tothe network transmission media 


fare diferent. 


is frequently used as an interim technol- 
‘ogy to bridge the gap between Ethernet 
existing speed and its next higher speed, 
However, it is only a temporary solu- 
tion, since more than four physical links 





‘become yery difficult to manage and are 
costly to implement and maintain, For ex- 
ample, standard network interface card 
‘only has enough room on its faceplate for 
fou connections, 
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Latency: 49 psec +1psec 442 MBytes/sec at 1% CPU 
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QUESTIONS? 


Now you can use Gigabit Ethernet for your data intensive applications 
and get the throughput you need without sacrificing your precious CPU 
resources. The XGE TOE PMC provides complete offload of the TCP, UDP, 
IR, RDMA and iSCSI protocols to silicon. Extensive driver and software 
Library support allows your application to easily access the XGE using 


standard Sockets or direct library calls 
CRITICAL 
visit our website at www.criticalio.com 








Another limitation of the LAG pro- 
tocol is the overhead incurred with new 
links. For every link added to the exist- 
ing LAG group, the overhead on each link 
already in the group increases slightly. 
‘The advantages of 10 GE are the result- 
ing lower latency duc to the elimination of 
the LAG protocol and is associated aver. 
head, as well as higher bandvidth vin a 
single 10 GbE link. In addition, 10 GRE, 
offers & wide range of cable runs, from 15 
‘meters on coax cables to 40 kilometers on 
single-mode fiber (SMP) 


10 GbE Applications 

Although 10 GbE prodocts, such 38 
switches and adapers, have existed since 
the sundard for 10 GbE over fhee-optic 





Before NTA After NTA 
[ENE since the original 10 GbE standard (IEEE 802.3ae) was ratified, 


‘amendments specifying a variety of new PHYs have been developed. Two 
‘now PHYs wil be added in 2006, and two moce in 2007. 





[EEEIE] Latency increases as processors wat for memory to move data 
‘across the network. A network rate accelerator (NTA) ca free Up 
CPU cyctes by optimizing the system's ability to process packets. 


transmission media (IEEE Std. $0231) 
‘vas ratified in 2002, certain impediments 
hhave prevented this technology from be- 
ing widely adopted. With the everincreas- 
ing nced for higher bandwidth, these ob- 
stacles will he overcome within the next 
few years to permit 10 GHE to penetrate 
into volume platforms, 

‘The higher speed and faster response 
time of 10 GBE can benetit any tafic-bur- 
dened network. This i particularly true 
fo data-intensive applications. Examples 
include enterprise financial applications, 
database and data modeling simulations, 
‘weather forecasting, computer-aided de~ 
sign and manufacturing, and graphics-in- 
tensive applications such as those found in 
‘computer games and movies. 








Previously, Gigabit Ethernet found 
its way into high-performance computing 
ster applications. As costs and obstacles 
decline, 10 GHE will grow inthis market 
‘Some of the more contrnersial applica 
tions includ the possiblity tha at 10 Git 
‘speeds, other technologies such as iSCSL 
‘wil begin to: make economic as well xs 
technical sense, potentially replacing exist- 
ing protocols suchas Fibre Chanel 

"Metro area networks (MANS), a8 well 
aslong distance local area netwark (LAN) 
applications, benelit greatly from the lon- 
spe distances available vis fiber optics 
‘One example is the ability to place data 
centers where they are most advantageous 
in terms of cost and convenience, such ss 
in offsite locations away from primary fi- 
cilities. The longer reach of 10 GbE and 
its ability to map to existing SONETISDH 
10 Gbits protocols also make it very suite 
able for wide area network (WAN) and 
Internet points of presence applications to 
implement a network capable of transmit 
ting data at terabyte-perhour rates 


What 10 GbE Is and Is Not 

In many ways 10 GDE is a straight- 
forward extension of previous Ethernet 
speeds, providing fll backward comps 
ibility to previous Ethernet generations It 
retains the key Ethernet architecture, the 
‘media access controller (MAC) protocol, 
the Ethernet frame format and the mini 
‘mum and maximum frame size 

However, there are also some key df= 
ferences, invalving primarily the physical 
layer (PHY) of the Open Systems Inter- 
connection (OSI) layer model (Figure 
1, Although the same MAC and recon- 
citation sublayer are used for all 10 GRE. 
devices, the PHYs for connecting to the 
network transmission media are different, 
Inaddition, unlike previous generations of 
[Ethernet speeds, 10 GBE operates oly in 
full-duplex mode. This is because by 1999, 
the use of switched Ethernet had grown to 
‘the point that support for the halF-dluplex 
‘mode of operation was considered bur- 
den tothe development of 10 GE, 

‘The original 10 GbE standard (EEE 
802.32) speciied a variety of PHYs for 
connection to optical multi-mode fiber 
(MMP) and SMF transmission media, 
Since its ratification, several new amend 
‘ments have been developed oF ate cur. 
rely in development. In 2004, IEEE Su, 
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02 2ak 2004 spied a 1OGRASE-CXA 
PHY fer the tansnsson af 10 CBE sig 
taling ovr 15 meters of coax cabling fr irs 
Shorireach application. In 2006, tw ; Pit 
few PHY wll beaded, and two marin meer 
SOOT The mew PHY adress expanding 
tmat nce for 10 GE (Figure 2 

xen though two 10 GHE PHY stan- 
dards have eed fora mune of yea, 
the cos ofthe 10 CHE optical slations 
tel the tort each of 10 GHE conn o- 
ition have limited widespread adoption 
af the echnology. Te cost und poner of 
the technology metres, but te ack of 
2 stndard that allows 10 GHE to oper 
fie over unshilled twisted put (UTP) 
cpp ealing mlcgates 10 CHE oly to 

Hetlns where it serial. Ratlce 
tt ofthe IOGBASE-T sandal (EEE “Tehut Networks 10 GDE network ae accelerator (NTA chips, shown 

i ; toe routed on a network interface card, reduce ley by opting 
Patz an), which addresses a UTP sola- Dor mnt on tec aia 





tion, is targeted for Jane 2006, Once this 
‘curs, lower cost 10 GBE capper PHYs 
‘will begin to appear, indeed, support auto-negotition. ‘The will provide backward compatibility 

Some common misperceptions are aulo-negotiation feature will permit im- with lower speeds, itis unlikely that a 
associated with IOGBASE-T. Confusion plementers to design PHY devices that  10GBASE-T system will be able to sup- 
reigns aboot the support of auto-negotia- operate over a broader range of speeds port operation at 10 MIbitis de to the re- 
tion, but the IOGBASE-T standard does, and media, Although auto-negotiation quirements ofthe transformer. 
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‘Ineed hard real-time "My customer wants high-end 
performance at the graphics, connectivity and a 
lowest possible cost. familiar interface!’ 


tes eas Dever the benefits of hard realtime contro plus high nd graphics, cannectty and acceso all Windows 
{ournextytem. Our nanproprtin,realsime extensions Cer and Vai enate Windows® CE or Vaio eon wh 
‘Windows XP ona ingle computer. The hard reak-ime capable ofthe RTOS are ot aflcted by Windows XP runing semutancousy. 
And programming ea using wellinown, proven tools. Addtonal contre hardware or cost inteligentco-procesce boards for 












‘aust are unnecesay 
Download the free demo software proven in over UKA Contols Gb 
70,000 systems at... www.huks-controls.com ae in iets rege Ts 
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‘Another common misperception interconnect for the clients of client-server simply choosing the right medium. Much 
concerns support for Category 5 (CATS) or client-workstations involved in datain- of 10 GbE system throughput must occur 
cabling. CAT-S and CATSe cabling are tensive applications, as well as a replace- in both the infrastructure, including rout- 
‘ot listed as supported media in IEEE ment for proprietary fabrics in storage ers and switches, as well as in the end= 








P802.3an, since IOGBASE‘T performance area networks. points, which are primarily servers, appli 
requirements far exceed the cabling’s For networks that require longeraul ances and networking storage equipment, 
specified performance. The I0GBASE- distances, sch as MANe and WANs, 10 To meet these demands, several tech 


T draft calls for the support of CATE6, GbE offers the ability 10 reach up to 40) nicaland economic concerns must be mit, 
CAT-6 Augmented (CAT-6A) and CAT-7 km. This provides large potential ben- Increasingly, higher performance, more 
cabling. The Telecommunications Indus- efi in terms of mach faster and more expensive systems are required to keep 
try Association's Telecommunications efficient aecess for applications such as pace with network throughput. However, 
‘Systems Bulletin (TSB)-1SS describes the telecommuting, video conferencing, data because of design constraints the memory 
requirements an existing cabling installa- mining and online research. In WAN ap- subsystem cannot keep up physically 

tion must meet to transport IOGBASE-T plications, 10 GbE allows the transport of ‘This means that latency is increas 
signaling. Existing CAT-6 installations terabytes of data in world record-breaking ing as processors wait through hundreds 
that mect TSB-15S's requirements should times, linking communities around the of idle cycles while memory works at its 
support aeach of upto 58m, CATE6A and globe. The ubility of 10 GBE toeffectively slower rate to move dats acros the network. 
(CAT? support reach of up to 100m. The attach directly to the SONETISDH core to the waiting applications. Both proves- 
primary difference between CAT-6A and network orta be transported aver unused sors and memory subsystem are litrally 
(CAT? is that CAT-6A is unshielded cable, wavelengths minimizes or eliminates the being flooded by the network bandwidth, 




















whereas CAT is shielded, previous need for protocol translation in so the industry has been secking ways to 

‘At the board level, the optimal in- order to achieve global connectivity alleviate this bottleneck. This disparity is 
terconnect will be primarily determined impacting the performance of today’s sys 
bby the reach required. IDGBASE-CX4 What is Needed to Achieve 10 tems that are designed to handle critical 
provides a low-cost, higher performance GbE Speeds? real-time transactions and bandwvidt-n- 


Solution for shor-hasl applications. It Implementing 10 GHE to achieve tensive applications, such as e-commerce, 
is being used as a high-performance true 10 Gbitls speeds requires more than medical imaging and data warehousing. 
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‘One of he most optimal approaches ta 
solving this bottleneck problem isthe use 
fof « TCPIIP accelerator solution. A TCP! 
AP accelerator, or network trafic accel 
tor (Figure 3) suchas the chip offered by 
“Tehuti Networks (Figure 4) reduces flood 
ing by optimizing the system's ability 10 
process packets. Implemented from the 
‘wire to the application, accelerator tech 
ology dramatically speeds up enterprise 
platlorm-to-network communications by 
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reducing latency, redistributing functions 
to components that will da each task bet 
ter and using additional enhancements to 
improve server system throughput. This 
allows the servers to maintain an optimal 
balance of performance and low power 
‘while requiring only minimal bord real 
cstte, thereby reducing the equipment’ 
total cost of ownership. 

In addition, interfaces are an im 
portant consideration, For example, the 
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(64-bit PCL busin the current Inte server 
architecture already parapsout dats inthe 
‘mult-gigabit range, whereas the new PCL 
Express architecture will easily seal ta 
sande 10 GBE. 


What Comes After 10 GbE? 

“Although the industry i inthe throes 
‘of preparing forthe ramp of 10 GbE into 
high volumes, history reveals that itis al- 
realy time to begin thinking about how & 
higher speed Euernet will come to fri 
tion. ‘The concerns discussed above will 
only be further exacerbated, and new con- 
cers added, asthe market looks toward 
the next speed generation alter 10 GBE, 

‘Some predictions state that by 2012, 
the next higher speed Ethernet adoption 
‘could mirror that of 10 GbE andl approach 
44506 share of primary interconnect in 
the Top S00 supercomputing sites. It is 
anticipated that in July 2006 there will 
‘be a Cal for Interest (CFI) in IEEE 802.3 
fora Higher Speed Study Group (HSSG), 
‘which could result in the ratification ofthe 
ret higher speed Ethernet standard some 
time around 2010, 

10 GbE shares many similarities 
‘with previous generations of Ethernet, but 
there are also some key differences. In 
the 10 GE standard itself, many of the 
differences are related to the transmis- 
sion media. However, in order to achieve 
‘maximum 10 GbE performance, the sys 
tem must be configured to support 10 GRE, 
‘throughout the network. This includes in 
proving TCPIP processing via hardware 
fast 40 the CPU has more cycles for 
application processing, as well as mak- 
ing sure that the right interfaces, such as 
PCI Express, ate in place It also includes 
chipsets with memory subsystems that da 
‘nt limit system performance. Many of 
those issues are being addressed today. In 
the very near future, the market can look 
forward to enjaying complete 10 GBE per 
formance and al ofits benefits, 
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FPGAs: The New Matrix for Design 


Changing Horses in Midstream: 
Partial Reconfiguration 
for FPGA Designs 


The ability to leverage partial reconfiguration for programmable logic opens 
new doors to a whole host of applications such as software defined radio, 
dynamic instruction set computing and automatic target recognition. 


by Mark Goosman 
Xing 






























cycles, greater design complexity and increased scaling 

(eg, “Moore's Law”) are adding inreasing pressure for 
faster design speed and lower power in a smaller physical space 
Although today's advanced FPGAs are rapidly evelving to ad- 
dress these issues of speed. power and size, new technologies 
in the area of partial reconfiguration offer the promise of even 
{greater advances, 

‘Partial reconfiguration isa design process that allows a lim 
ited, predefined portion of an FPGA tobe reconfigured while the 
remitinder ofthe device continues to operat. This is especially 
valuable where devices operate in a mission-critical environment 
and cannot he disrupted while subsystems are redefined. The 
bility 1 partially reconfigure a device takes the already power- 
ful benefits of reprogrammabiity 1 a much higher level 

‘The obvious benefit of reconfigurable devices, such as 
FPGAs, i tht the functionality with which a device is contis- 
‘ued can be changed and updated at some time im the future. As 
‘addtional functionality is available or design improvements are 
‘made available, the FPGA can be shut down, completely repro- 
‘grammed with new logic and operations can be resumed, Partial 
reconfigurability addresses the environment where logic needs to 
‘he changed or updated within a part of an FPGA without disrupt 
‘ng the entire system. This may be a design comprised of several 
blocks of logic and, without disrupting the system and stopping 
the flow of data, requires an update of the functionality within 
‘one block. 

Using partial reconfiguration, designers can deamatically 
‘nerease the functionality oa single FPGA, allowing for fewer, 


O=: trend in lope design such as shorter product ife- 


smaller devices than would otherwise be needed, This allows 
{for additional functionality, lower power, redaced cost and less 
‘physical space on the board 

arti reconfiguration i useful for systems with multiple 
functions that can time-share the same FPGA device resources. 
{In such systems, one section af the FPGA continues to operate 
‘while other sections of the FPGA are disabled and reconfigured 
to provide new functionality. This allows concurrent support 
{for multiple independent applications in a single FPGA. This is 
somewhat analogous to dynamic task switching or multitasking 
of a general-purpose processor. Without this capability it would 
bbe necessary to reconfigure the entire FPGA. to support a dif- 
ferent application, which would result in the loss ofall previous 
applications, 

Partial recontiguration provides an advantage over multiple 
full bit streams in applications that require continuous operation, 
‘which is not otherwise accessible diring full reconfiguration, 
(One example is a graphics display that utilizes horizontal and 
‘vertical synchronization. Because of the environment in which 
this application operates, signals from radio and video links 
‘need tobe preserved-—but the format and data processing format 
‘may requie updates and changes during operstion. With patil 
reconfiguration, the system can maintain these real-time links 
‘while other mouules within the FPGA are changed on-the-ty. 

‘In order to implement partial reconfiguration on an FPGA, 
it frst requires an FPGA that inherently supports the dynamic 
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All static logic Is grouped ina single Plock AG. base. 





reconfiguration of only portions ofthe device, while leaving the 
‘ther portions unaffected, Then a set of software development 
tools dre needed that support the development of applications 
restricted to boundaries that comply withthe hardware architec: 
ture of the FPGA. Finally, some form of basic controller mst, 
be avilable to dynamically manage the reconfiguration of the 
FPGA. This could be an embedded general-purpose processor 
(GPP), a soft core GPP, or an external GPP connected to the 
PPGA. In this shased resources model, the stme embedded GPP 
thats running the design infrastructure and operating environ 
‘ment is also managing the partial reconfiguration of the FPGAs, 

Than FPGA, all user programmable fetures are controlled 
thy memory cells that are vollile and must be configured on 
powerup. These memory cells are known as the eonfigaration 
‘memory. and define the look-up table (LUT) equations, signal 
‘outing, inputoutpat ck (1OB) voltage standards and al other 
aspects of the desiga, 
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‘The hierarchy for a partial reconfiguration design. 





"To program configuration memory, instructions forthe con 
figuration control logic and data forthe configuration memory 
are provided in the form ofa bitstream, whick is delivered tothe 
device through the STAG, SelectMAP, serial, or ICAP configura- 
tion interface 

‘Typically, a user performs the initial programming by dows 
loading an entire bitstream to an inactive target device. Using par 
tial reconfiguration, a subset of the FPGA can be reprogrammed 
‘using a partial bitstream, You can use partial bitstream to change 
the structure of one part of an BPGA design asthe rest ofthe 8e- 
tive device continues to operate 


‘A Methodology for Partial Reconfiguration 

‘Successful implementation of a design using a partially re- 
configurable flow requires following a strict design methodology. 
‘A reconfigurable design will consist of partially reconfigurable 
modules (PRMs) that will be swapped in and out of the FPGA 
and the static logic, which will remain in pace. The general pic- 
ture ofthe design flow involves the need to insert bus macros 
between the PRMs and the rest ofthe desig, the static or fixed 
logic that remains in place. Bus macros are the channels or ports 
through which movdules communicate and pass data, Thi allows 
fixed communication channel for the static logic regardless of 
the reconfigurable logic om the other side 

‘Successful design requires following the guidelines of the 
synthesis tools to generste a partially reconfigurable nt lst. The 
synthesis tool must be configured so that na optimizations oc- 
cr acros hierarchical houndaries, This is generally dane with & 
KEEP_HIERARCHY or similar dictve 

‘The next step is to use the tool to floorplan the PRM and 
luster all static modules together and then place the bus macros 
between the PRMs and the static loge following PRM specific 
design rules, Finally, run the partial reconfiguration implementa- 
tion How 

PlanAhead 8.1 fom Xilinx is an example ofa single envi- 
‘ronment (or platform) used to manage the preceding guidelines, 
‘which cam be broken down into the following steps: 

1. Netlist import 
2. Ploorplanning the desig for partial 
configuration 

1 Design rule checks 
4. Netlist export 
5. Implementation flow management 
6. Bitstream size estimation 





Although these steps ase sign. 
war, the metbodlogy requires meticulous 
implementation in onde o ensure success 
Changing out «portion of «complex, high 
speed design docs not allow mut margin 
foreroe 

Use a tol like Plan Ahead that works 
‘wih imported net lists, such as those 






from XST oF Synplify, to import any 
hhicrarcical netlist (single edfiage or 
multiple edfnge files). Then fllow the 
regular guidelines to import the design 
into the tool and ereate a floorplan as 
you would with any nos-partally re 
‘configurable design 

Floorplaning for partial reconfigu- 
ration isan important step inthe partial 
reconfiguration flow. Ploorplanning is 
‘hased on design partitions referred to as 
physical blocks, or Phlocks. A Pblock 
can have an area (such as a rectangle) 
defined on the FPGA device to con- 
strain the logic. The designer can de- 
fine Pblocks without rectangles and the 
implementation software wil ltempe to 
‘group the logic during placement. Net list logic placed inside of 
Phlocks will receive AREA. GROUP constrains 

Ploorplanning for partial reconfiguration entails several 
key subtasks. The first sublask is to assiga an area for the 
PRM by creating a Pblock with an area defined within the 
fabric, This includes assigning the values for RANGES for the 
Phlack. The MODE constraint must be defined forall recon- 

gurable regions (MODE=RECONFIG). This constraint pre- 
vents the implementation tools from failing with unespanded 
block errors during implementation of the static and recon- 
gurable modules. 

Every top-level module, other than PRMSs, shouldbe grouped, 
together na single Polock. This is called a static logic block. This 
block should not have a RANGE defined: this will caster the 
static logic together ina single Polock. Select all top-level mol 
ules (except the PRMs) and assign them to a Pblock. Figure 1 
shows the static lngie grouped in &Phlock named AG. hase, When 
the floorplanning is completed in the desig tools, the resulting 
‘physical hierarchy will he organized as shown in Figure 2 

The next step is to place the bus macros, Bus macros are 
physical ports that connect a PRM to static logic. Any connec 
tion from a PRM to static logic should always go through a bus 
macro, Hus micros are instntisted as black boxes in RTL and 
are filled with a predefined routing macro in the form of an ame 
file, Bus macros are placed on the PRM boundary. Stati l 
connected to PRMs will migrate towant the bus macro during 
placement. 

Given the complexity ofthe ow its very common for mis- 
takes to be introduced in the original RTL and during the oor 


























Planning process. Any tl worth its salt wll check for design 
‘ioltions. Also integrated into this Fate, in the case of Pac 
‘nAbead is the PR-Advisor, which provides feedback on how to 
improve your desig, There area numberof desig rule checks 
that are speci to Partial Reconfiguration, 

‘The bus macro DRC provides versa forall design rules 
‘elated to hus macro connectivity and placement. One expe 
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‘The PREP DRC verifies all rules that should be followed 
for bus macro placement 
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‘of hus macro DRC isthe PRP check: This DRC checks forall 
rules that shouldbe followed for bus macro placement, Figure 3 
shovts an example of design that failed the PREP DRC. In this 
‘ase, the tool tells us thal th interleaved!nested macra should be 
placed at SLICE_XALY. 

‘The Floorplanning DRC covers foorplanning rules. Clock 
‘objects (global clock buffers, DCM) and L/Os should be placed 
and static logic clastered. The gitching logic DRC verifies lith- 
ing logic elements (SRL and distributed RAM) above and below 
PRM regions 

"Another DRC is the timing advisor/DRC. This provides a 
check for timing-related issues, One example of timing DRC 
is the PRTB check. With the PRT check, the static module 
is implemented before the PRM during the implementation 
phase. Regular timing constraints do not cover the paths that 
crass between the static and a PRM module, This does not 
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LTT > (Once the design planed wa 
it the DRG ck ly tobe 


exported, The design tools should take 
care of exporting the original hierarchical 
net list into a PR-style net list that has a 
specific format (static and PRM in sepa- 
rte directories The export directory will 
appear as shown in Figare §. Next pare 
tial reconfiguration flow wizard, shown in 
Figure 6, runs the partial reconfiguration 
implementation on the exported design. It 
‘will produce a full bitstream forthe com- 
plete design and a partial bitstream for 
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Upon sucessful compilation ofthe faerplanning DRC processes, all “Assembly and bitstream generation 

‘modules wil be displayed in black text within the Export Floorplan dialog box. (results are stored inthe merge dire 
tory) 

wise seegaae The Phlock statistics report includes 


4 section that reports PRM bitstream 
size (Figure 6). This information can be 
‘sed for estimating the sizeof configu 
‘ation memory storage such as external 
flash and DDR. This information can 
also be wsedto calculate how lng it will 
take to swap the module based on your 
bitstream memory interface, 

Partial reconfiguration offers a 
tremendows opportunity for designers 
looking for a way to increase the func- 
tionality of their design, achieve lower 
power dnd reduce the numberof devices 
fon their board. Using new design tol 
and techniques becoming available for 
partial reconfiguration applications can 
srcatly simplify the complexities of jug 

ling the dynamic operating environ 
rent of these cutting-edge application, 
present a problem, provided that the bus macro is synchro- allowing a single device to operate in 
nous. However, if it is an asynchronous bus macro. the static applications that previously required multiple FPGAs along with 
module does not know about the propagating of asynchronous the required power, hoard space and design overhead. 
paths, as shown in the example in Figure 4, This could be 
importane if these paths are timing-critical. One way to pass Xilinx 
this information ta the static module is to specify ® TPSYNC San Jose, CA. 
constraint on the bus macro output net, PlanAbead software (408) 859-7778. 
‘will recommend a TPSYNC constraint that can be added to (www.alincom). 
the UCF file. 








‘The display of Polock properties Includes the estimated sizo ofthe bitstream. 
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Reduce Cost, Risk and 
Time-to-Market with an 
FPGA-to-Structured ASIC 


Strategy 


The costs, development times and risks of ASIC designs have become all but pro- 
hibitive. Finalizing a design on an FPGA and moving it to a structured ASIC can cut 
cost, time and risk as well as result in smaller size and lower power consumption. 


by Danny Biran 
Altera 


lew would argue that ASICs and ASSPs have become viable 
vehicles for only afew silicon vendors. As process nodes 
continue to srink—to 90/am now and to 6S nm in the near 
future—not many vendors are willing to take om the risk associ- 
ated with an ASIC design unless they have a very high level of| 
‘cntidence that their chip will be sold in huge volumes to justify 
fn investment that is often tens of million of dollars (Figure 1, 
[ASICs are under pressure. Its increasingly dificult for compa- 
nies to justify the developmen ofa new chip with the very high 
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EEEEIEN the cost of tip development goes upcramatcaly as technology nodes 


IRE price tags. If chip development cost is S30M, RAD costs 
are 20% of revenue, and one expects 10% marketshare, only a 
S15 billion market opportunity can justly the expense. There 
arent many markets of this size that can be serviced with w single 
product. 

‘One ofthe risks associated with building an ASIC is that ts 
functionality is fixed during fabrication, resulting inthe need 10 
“yet it right the first time,” which can often be hard to achieve 
‘The cost of redesign for even a small portion of an ASIC that 

doesnt work may be probibitive in terms 
fof both time and money. Ta mitigate this 
risk, FPGAs are often used as prototyping 
‘ehicles. With their high levels of logic 
density and performance, high-density 
FPGAs help developers implement com 
plex designs that cam be tested inthe t= 
fet system, FPGA prototypes offer many 
distinct advantages, allowing a design to 
have its hardware, application software 
and firmware fully developed and tested 
at full speed without » muli-million-dol- 
lar pric t 





em Moving to a Structured ASIC 
(Once the design i protoryped in the 
FPGA, the next sep is 10 move to an ef 
fective silicon platform. Migrating (rom 
an FPGA to an ASIC is tricky and time- 


‘consuming, since there is generally a good 


































deal of work required to make sure the original design 
‘will still function correctly when implemented as an 
ASIC. 

‘A etter solution is to mone the verified FPGA design 
‘onto a structured ASIC, which has the characteristics of 
an FPGA and most attributes of an ASIC, and is signit- 
icantly smaller than an FPGA. A structured ASIC has 
‘near ASIC performance and power consumption, along. 
‘witha shorter development cycle und lower NRE than aa 
ASIC. However, it hasa much lower unit cost than a com- 
parable FPGA—as much as one-tenth the ent 

Structured ASICs are comprised of prefabricated 
base arrays with predefined and verified logic, memory, 
clock networks and UO resources, The areays are pro- 
cessed through manufactoring up toa certain point and 
then “banked” for futute customization. Application-specifc de 
signs are then configured onto the base arrays using & few top 
‘metal layers, thos creating the structured ASIC. This gives a 
structured ASIC a postfabriation configurability dimension 
that s not available with an ASIC. Acconding to Gartner Group, 
structured ASICs are rapidly gaining in popularity with revenues 
projected to grow from $99 millon in 200 to $848 million by 
theend of 2007. 

Designers should be avare that differen structured ASIC 
vendors implement their architectures in very different ways, 
even with different families from the same vendor. Parameters 
such as number of user-detinable layers, number of equivalent 
ASIC gates, memory capacity, maximum clock rate and target 
Processes are very different from vendor to vendor, making the 
‘migration from FPGA of ane vendor to structured ASIC of an- 
other, while easier than to an ASIC, sil difficult and fraught 
‘with potential errors that can add time and cost to reaching pro- 
duction silicon. 


The Key for Structured ASIC Success 

1 the same silicon yendor produces both silicon platforms, 
‘moving a design from a system-verified FPGA to a structured 
ASIC, while maintaining functionality and meeting timing con- 
strains, is greatly simplified and provides the customer with a 
low-cost and sife “path to production” strategy. A ome-to-one 
mapping between the FPGA snd structared ASIC can eliminate 
redesigning the board, and redeveloping and revalidating the s- 
tem, resulting insignificant development cost savings and time- 
to-market benefits. Proven IP cores, common to both platforms, 
single design fow, common EDA tools and pin-t-pin compat. 





ibility between the FPGA and the structared ASIC assure that 
the smaller, less expensive structured ASIC will work correctly 
{nthe application. 

‘The costs associated with a structured ASIC are much lower 
than those for an ASIC. Fora 90-nm design with, say, 2.2 mail- 
lion ASIC gates, 9 Mbits of SRAM and 1.5 million gates for DSP 
and multipliers, NREs are often ws love as $225,000 vo $300,000, 
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‘This Adaptive Logic Module (ALM) isthe basic loge block of 
the Strat II FPGA. Using ALMs instead of raditional 4-input 
look-up tables (LUTs) increases logic utlization efficiency and 
performance. 

set for an ASIC. Designs prototyped and yeriied in an FPGA, 

and then migrated toa structured ASIC can be sed forall bu the 

‘very highest performance andar cost sensitive applications that 

‘would justify the cost optimization realized by a full ASIC. 

‘When migrating from an FPGA to a structured ASIC. the 
‘vendor can eliminate a lot ofthe circuitry from the FPGA that 
isnot required for normal chip operation. The removed circuitry 
includes FPGA configuration logic, programmable routing and 
logic and memory progeammability. What needs to be added to 
the structured ASIC is embedded testability. since the circuitry 
needed to test the structured ASIC is much differen thin that 
{for the FPGA. Eliminating all the extra transistors results in a 
‘mich smaller chip with coresponding unit cost redetion of 
much as 90%, The structured ASIC also sees significant power 
‘eduction. The shorter interconnect paths result in ker dynamic 
power dissipation od the structured ASIC has lower stale power 
‘consumption withthe elimination ofthe many transistors used 
{for configuration and programming onthe FPGA. 

‘key to successful FPGA and structured ASIC migration, 
even lien both silicon platforms are from the same vendor, i 
‘how well the asic logic building blocks of the structured ASIC 
implement the logical functions ofthe FPGA. This i not asi 
ple task, and requires alt of work on the part ofthe vendor who 
has developed both platforms, 

For example. the logic structure ofthe Stratix Il FPGA ar. 
citecture comprises basic logic units known as adaptive logic 
‘modules (ALMS). As shown in Figure 2, each ALM contains 
a variety of look-up table (LUT)-hased resources, two full ad- 
ders, carry-chain segments, two fip-lops and many additional 
logic enhancements that can he divided into two adaptive LUTs 
(LUT) One ALM can implement logic functions with up to 
‘seven inpats and complex logi-arthmetic functions, increasing 
logie efficiency and reducing routing resources, 

‘The HardCopy LI structured ASIC family comprises an ar- 
ry of fine-grained structured cells called HCells that are grouped 
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EI it soctons ofan ALM are not used in the FPGA design, then 
they are not mapped tothe structured ASIC. This increasos 
the efficiency of the FPGA structured ASIC operation. 


into HCell macros to implementa portion of an ALM or a section 
sins library contain 
ing a pre-verifed, pre-characterized Hell macro for every ALM 
configuration, which then maps the ALMs into astructured ASIC 
design. The tol only maps the utilized portion of each ALM to 
Hell macros (Figure 3); if parts of an ALM are aot used in the 
PPGA design, then they are not mapped to the HardCopy Il 
vice, yielding a more eficent mapping of the prototyped design 

When compared to a corresponding Stratix Il device, 
HardCopy I structured ASIC is 50% faster, dissipates up to TK 
lower core power and has a 60-85% smaller chip size. 

‘A HardCopy structured ASIC also offers time-to-market ad- 
vantages. The typical time it takes an ASIC/ASSP to go from 
design initiation to competion is around two years. During that 
time, market opportunity forthe chip can be severely hindered or 
itcan even disappear. Taking a design ton structured ASIC typi- 
cally takes half that time or less, allowing the product to reach 
the market significantly sooner than if it were implemented as an 
ASIC of ASSP. 

The secret to making FPGA design an integral part pret 
development beyond using the FPGA only asa prtotype device 

{sto providea clear and low-risk path fom 
the FPGA to a producton-vible silicon 
platform, such as a structared ASIC. With 
rear ASIC performance and cost, suc 




















RadHard Eclipse FPGAs tate for bd age af apts 
are now in production | tht are currently filled by expensive and 


unreliable (in terms of cost and develop 
rent schedule) ASICs and ASSPs. How- 
ever, with so many different structured 
ASIC architectures and husiness models 
available, an important factor in suceess- 
ful FPGA-to-Structured ASIC migration 
is re-mapping the logic functions of one 
platform to the other as opposed to doing 
fan expensive and risky architectural con- 
version. ‘This i accomplished by working 
with a vendor who fas developed both the 
FPGA an structured ASIC architectures, 
along with the EDA tools a designer uses 
to develop and verify the target design om 
the FPGA and then migrate the design toa 
structured ASIC. El 
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Moving Real-Time Data around 
FPGA-Centric Systems 


Support for real-time FPGA networks is emerging from board level vendors. 
As larger FPGAs and solutions that are more sophisticated are required, this 


will become increasingly important. 


by Jeremy Banks 
Metro 


fomputationally intensive applications often in- 
volve realtime high-bandwidth data streams 
coupled with fast signal processing. This increas- 
ingly involves using multiple FPGAs because they are 
faster than CPUs or DSPs. However, coupling FPGAs 
(or IP cores) can be difficult and can consume unneces- 


sarily large amounts of FPGA resources leaving fewer 
resources forthe actual signal processing. Since the data 
‘movements can occur in parallel, the network commu- 
nications can also be complex (Figure 1). The ability to 
establish links between FPGA IP cores, no matter where 
they are physically located, is important. 

“The tend with CPU dats communications sto move 
sway from parallel bus structures in favor of multiple se- 
rial pointo-point data links. This makes systems easier to build 
and improves system performance because CPUs do not have 19 
share a bus—instead there are dedicated communication paths that 
‘optimize the system communication. This also improves determi- 
jm by relacing the data trafic and the numberof dats sources on 
a lnk, an essential requisite fr seal-time solutions. 

“The most popular serial fabrics used for point-to-point com- 
‘munications include PC! Express, Serial RupilO, Gigabit Ethernet 
tnd InfiniBand. High-speed serial communication (HSSC) using 
these fabrics is hesoming the backbone of modilar high- perfor 
‘mance processing solutions through the adoption of standards such 
1s VXS (VITA 41) and VPX (VITA 46) (Figure 2 and 2). Boards 
and systems built using HSSC are providing high-density. tightly 
Coupled FPGA and CPU soltions with high-bandwidth JO, Such 
solutions are well suited to real-time applications, 











Protocol rich HSSC fabrics work well fr CPU-centric systems, 
but for FPGAs, these fabrics area huxury that surifices resources 19 
implement complex communications, For systems that include both 








EW} Data flow around networks of FPGAS can be complex. 


[FPGAs and CPUS, using CPU-centric fabrics at the CPU-FPGA, 
‘boundary makes sense. However, for direct FPGA-to-FPGA com- 
‘munications an alternative is required in both the type of fabric and 
Iho i is used —after all, an FPGA is nota CPU, and treating it as 
sch will dilute the tre benefits offered by FPGAS. 


Data Flow 

‘A common way to design a system in its early conceptual 
stages isto drawa block diagram. In doing this, the designers de- 
scribing the major processing blocks and how data moves through 
them, Memory mapping is @ concept that CPUs use to arrange 
and process their data. Memnory-mapped fabrics, such as PCL Ex: 
press, ft well with CPU-centric processing models. Implementing 
the conceptual design onto a CPU requires “converting” the Blok 
dligram into the CPU's memory-mapped mode 

‘By contrast, an FPGA can be used to layout the block dia- 
am “as is" parallel lows of data can be the same asthe block 
diagram, as can the processing blocks. Along with increased per- 
formance, this difference of one-to-one mapping is a key advan- 

‘communication, 
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EET Example vxs board-basod solution inking multiple FPGAs together over HSSC inks (), and examples of board level 
components, a VXS swith, VXS FPGA and FPGA PMC with HSSC links (). 


ver, implementing the block diagram and its data ows on an 
FPGA presents some practical limitations. Eyen assuming all the 
processing blocks (or IP) are availabe, there is sil the issue of 
‘whether all ofthis functionality can be fitted onto a single FPGA 
device. Lfnot, can the parallel and independent data lows between 
processing blacks on diferent FPGAs be maintained and operate 
asf they are processing hlocks on the same device? 

Imposing CPU-style fabrics to manage the dataflow would 
bea step backward because it would mean handling the data in 
dierent way than those needed by the FPGA and would lon- 
ser represent a“flow." However, simple realtime communications 
networks hased on data Haws are becoming available. In thei sim- 
ples! form, these real-time networks need reliable dat links and a 
tag to desribe which processing block the dat is to he delivered 
sand, ideally, what the data represens. Dedicated real-time com- 
‘munications network IP cores can handle all ofthis work without 
having to implement memory-mapped fabrics. 


Real-Time Communications Networks 

Real-time systems are deterministic: fora given event, any 
vent, a real-time system must respond within a guaranteed pe- 
riod, This isa common statement dictating what a real-time, de- 
terministc system has to be able to achieve. When FPGAS are 
used for processing because oftheir speed, the ability ofthe et 
work to deliver data within guaranteed constrains is even more 
important. Why use a device for performance if you cannot get 
tata to it in the first place? For a system to be deterministic, the 
communications network must also be deterministic. This can be 
further complicated if there isa nee to provide support for mal- 
tiple data streams across afew links. This sx potential bottleneck 
snd threat to real-time performance, 

‘What makes a network deterministic? [the network ean reli- 
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ably deliver its complete payload within a guaranteed time period, 
‘no matter what is happening elsewhere on the network, i eat be 
‘considered deterministic. For dedicated point-to-point connec 
tions, this is straightforward. However, what about packet site 
connections where the data hops through many devices before 
arriving at its destination (Figure 3)? Or, what about situations 
‘where multiple data streams mus share physical links? 

'Non-bocking switches are part of the anpver, but there is ma 
substitute for determinism by design, a higher-level system concept. 
‘One ay todo this is to ensure that all data pats trough the s3- 
tem are known (ideally fixed) with dedicated, allocated and guar- 
antced bandwideh—all the way through the system. If switches are 
‘used, they should be nos-blockng on ll data channels. 

For properly designed real-time networks, the protocol can 
emin relatively simple and low level ideal for FPGAs because 
{ewer resources are used, Since data paths between nodes are 
-knowen, features such as out of onder dita packet banding i an 
necessary (as used by TCPIAP) as is flow control (other than mov- 
ing data between clock domains using simple FIFOs)-To a certain 
extent, even error correction (eror detection is still important) 
can be simplified for real-time systems, otherwise, the question of 
‘what to do with the error is raise. 

By definition, if the data has errors in the fis plac, then resend 
ing ts ro guarantee that twill be corect the net time. I the data 
‘connection ale (errs occurred or no data at all was received why 
shouldnt ail repeatedly. This indeterminate station i ot good for 
‘realtime system. The solution i usually a system design parameter 
of being able to live with the eros. For an imaging aplication, the 
processors may makethe decison to thr aay the dta iit uses, 
carry on and resynchronize. For FPGA-tused designs, this simpliica- 
tion of the network protocol saves large amount of PPGA resources, 





Efficient FPGA-Based Fabric 
‘Communications 

‘Chasers of FPGAS that link steams of data 
between devices in el time nee efficient pon: 
to-poin data inks and protocols, Using protocols 
such as PCI Express consumes large amounts of 
[FPGA resources. A PCI Express x4 core could 
‘sccount for as much as 30-40% of a Xilins 
XCIVPSOs resources. By const much sim 

protocol, such as Serial FPDP uses s ithe 











351% ofthe resources (fora x1 channel ofthe same size FPGA. But 
Serial FPDP was developed as simple sensor interface rather than 
1 network protocol (Table 1}. However, the Aurora protocol, devel 
‘oped by Xlink, is optimized for FPGA-t-FPGA communications. 
‘By comparison, Aurora uses around 4 of an XC2VPS0 FPGA: this 
is fora x4 channel (ess fora Ix channel and includes lane alignment, 
‘ror detection and low contral—hoth user-defined snd native. 
Seri protocols using muli-Gbits data links are an efficient 
use of HO pins for an FPGA. However. even the largest FPGA 
devices have no more than 20-80 full duplex serial links, ori sed 
as <link for higher bandwidth, there are only four to six chan- 
nels. If that isthe case, what about large clusters of FPGAS with 
‘multiple datatypes and high-connectivity requirements? How are 
the limitations in the numer of connections resolved? Far these 
Scenarios, the data links have to be shared with the diferent data 
flows between or through devices using a real-time network. In ef 
fect, the network must support bridging through an FPGA so that 
logical datapaths can be established anywhere around the system. 
“The problem with the simple point-to-point protocols devel 
‘oped for FPGAS is that they da not handle bridging via an interme- 
diary FPGA. Higherlevel protocols are required that understand 
‘what data is targeted at them and if tis not, where it should go. I 
the data low is el-identifying, den it should be straightforward for 
the real-sime network to bane this. However, i snot something 
thatthe developer wants to deal with; there is an increased expecta- 
tion that his is something the system vendor should he providing. 


A Developer's Perspective 
Handling FPGA communications in the development of an 

FFPGA-cenric design can be complex. While the performance ad- 

‘vantages of FPGA are well understood, implementing the network 

‘communications efficiently is critical for a successful real-time 

system. Ideally, a developer would like to hye a toolkit of firm 
AP: as they have 
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Example of FPGA o-FPGA components as provided ty VMetr's TransComm. 


Poteet 20-40% 
Sel OP xt) * 
‘ore * 


Table 1 FPGA Core usage for examples of communications protocols. 


blocks. These components must he provided in such a way that 
‘only the components tat are needed are inclu at compile time 
{or optimal solutions, rather than generic code blocks tha cater to 
all situations, This saves valuable FPGA resources by removing 
‘unnecessary IP cores for data channels, DMA controllers, et. 
“The dal component to dathis nck communication link con- 
tors, which maintain the physical imerlaces suchas a high-speed 
serial communications link. x parallel LVDS ports with the ability © 
‘Support virtual data streams, nom-locking swiss and simple intr: 
{ices for IP t link int the fabric, et. Suc toolkits are now becom 
ing available from companies such as VMetro with its TrarsComm 
firmvare and software tools (Figure 4). With such toolkits, creating 
4 real-time communications fabric for FPGAs, perhaps ince on 
analog input designs, becomes much easier with eiuced ik 
‘Developers want to harness the power of FPGAs and foeus on 
their own expectise, the application IP not the network communica 
tions: Support foreal-time FPGA networks isemerging from bos 
level vendors. As larger FPGAs and solutions that are more sophisi- 
cated are required, this will become increasingly important. 


metro 
Houston, TX. 
(281)884.0728, 
[wwwsmetr.com, 
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Design Methodologies 
Help Leverage IP for 
FPGA-Based Development 


For FPGA-based designs today, the spectrum of IP available on the market, 
common implementations in industry and fine tuning with technologies under 
the hood make leveraging IP a strong consideration. 


by Jeff Harriman, Xin. 
‘Jeff Meisel, National Instruments 


duction in overall software development costs and a shorter 

ime-to-market, both of which are well documented. A fea- 
ture sometimes overlooked, however, is the tremendous design 
flexibility gained when using intellectual property (or IP blocks) 
through recent technological advances in compiler optimization, 
place and routing, and verification tools, 

‘As FPGAs have hecome faster and more powerful, the amount 
‘of work rire of them hs also grown, Yeats ago, an FPGA. was 
expected to provide some esenil glue logic to He a boar together, 
Bat today, the ted is o pull tasks historically handled by dedicated 
ASICS into the bear ofthe FPGA design. The increase in wn han 
led by FPGA can be atrbutd to the giant leap in technology that 
‘have shed speed an size tone eels, However. the challenge is 
fecively using an FPGAYS fabric resources, which requires high4evel 
‘understanding of the architecture and roaring process. Luckily, FPGA 
‘senudrs an third-party experts offer building blocks nthe form of LP 
blocks (also called IP cores) to simplify the design process 

IP cores cover an unlimited spectrum ranging trom basic 
functions to extremely complex design blocks. Vendors and tied 
parties offer cores including networking interfaces, system VO in- 
terfaces, communications blocks, digital signal processing (DSP) 
functions and external memory interface controllers, as well as 
site of IP for emberlded systems, These general categories fea- 
ture several predefined functions suc as those listed in Table 1 
Cores, which can typically be parameterized to meet the specific 
‘needs of design, are optimized to take advantage of the features 
of the FPGAs for which they were designed. Most cores from 


c= ease in FPGA designs offers advantages suc a a re- 





‘Got Connected wit canpars marioad i Bisa 
worstomagazine com/getconnected 





FPGA vendors come with documentation similar to stand-alone 
[ASIC data sheets. You can see the growth in the popularity of 


ode ese by the growing online community of developers har 
ing opetsource IP through Web sites such as OpenCoresrs. 

‘System requirements can changeover time. You should ex- 
pect and plan for thea te outset of your design, Figure 1 shows 
bho adware developer of « DSP board coud implement an 
PPGA-tased design bul sos entirely on IP blocks. The block 
highlighted in red, which represents the “i-houwe IP” thatthe 
company eretes to diferente self from the competition, i 
implemented as a processor core of as sate machine logic. The 
company also mist implementa communicnion profocl among 
‘locks so dat can be bused between the diferent components 

or instance, you can reycle a DSP application involving & 
speciic chain of fiers with dffere imerfuces, Once you have 
‘espn nee he i Spl preg 

can alter the meats by which dat is presented tothe ier 
‘elie quickly, fan an Edeact interac to PCI Egress 
interface, without having to redesign te signal processing chain 
othe ba of the daa transfer interface. 

Inthe cent techasogyenvrament, citing ge dens can 
<quckly become utd. Wit Pouca save develope ine and 
‘oid chasing afer ecology ents, For example, National Ins 
‘nes ses the PCI Espess LogiCOREE rn Xilins omigate ts PC- 
tse dia acquit systems othe newer PCL Expres sada 

“The design lilt th P bring tothe tbe also roids you 
‘st he ably owe higher eve foals for rap sysiem development 
ty simply droping in an IP lock, The pected nef er ing 
{0a higher level of abaractin in programming languages is tht an 

‘easier development experience comes at the cost of code optimizabon. 
Hever betase IP ifn pre-compiled and optimized fr apart 
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‘rei oftthe-shelf (COTS) 
hardware, The parallel nature 
of graphical programming maps intuitively tan FPGA, 


Table 1 Example IP blocks aval 


Under the Hood 

Developing intellectual property source code in an organized, 
‘modular and hierarchical fashion results in faster development, bet 
ter performance and better cade reuse im the long run. The indi- 
vidual function blocks, which serve a well-defined purpose, have 
clearly defined inputs, outputs and parameters. Because the Block 
js well defined, you can mere easily avoid negative Feature creep— 
telding unrelated fetures that are specific toa single use case. In- 
stead, you can add features overtime that make the function block 
‘more general-purpose to suit a wider aay of applications. This s 
“positive” feature creep. Additionally, you can test, validate and op- 
timize function blocks of an individual basis, Knowing these key 
[P development principles can help you develop your own in-house 
Por use commercially available blocks. 

[No matter the tool flow, the sume design principles apply 
‘when optimizing your IP for @ particular device. The key bere is 
1 solid understanding ofthe tools at your disposal, including the 
most effective ways to use them. This translates to deep archtec- 
ture knowledge of your target FPGA. The standart features of po- 
_grammable logic devices now inclde modolar building Blocks that 
Improve size and performance forthe majority of designs, Various 
device families include dedicated 18x18 multiplier blocks, 18 Kbit 
Jock memory, distal clock managers, FIFOs and a DSP48 slice to 
complement the ret ofthe configurable FPGA fabric. Understand- 
ing the capabilities of these blocks and the routing resources avail 
able helps you maximize your design performance. 

While you can often infer these blocks using synthesis toks 
with generic code, you need to ensure their optimal 
use. A gocel example of this is a custom multiply- 
accumulate (MAC) FIR filter. A MAC FIR is com- 
prised of storage clements, contra logic, multiplier 
tnd an adder. The specs for such a design should, 
ata minimum, include the data sumple rate and the 
desired frequency response. A this point, take stock. 
ofthe requested specifications and observe how they 
-map to your hardware with regard to your target ar 
‘hitecture. What isthe ratio between the sample rte 





fable from FPGA vendors and third partes, 


and the sytem clock? How many taps da you need an! what bit 
‘width should you use taking quantization into account? 

‘Whea you answer these questions do nt just consider the algo- 
rithm, Also examine the building blocks already available. Ina Vie 
tex the DSPUS slice sida for performing MACs with a system 
‘ck running up 500 MHz. Once you determine how many coel- 
ficients you nce, compare that tothe different memory resources 
availabe. The four input look-up tales map well to memories with 
Sept increments of 16. However, as distnhured memories become 
large, the sped they can un at dereases and memory can become 
st otleneck, AL what pont should you switch ove tow dedicated 
block RAM? The answer to that question depends on your perfor: 
‘mance ad resource requirement balance. In a predefined IP block, 
‘sou never have 100 percent Hexiility to contol ll of these details, 


Tweaking the Tools 

‘Once you have written the code and verified it behaviorally 
sou sil have several steps left that influence the final hardware 
results. fyou have been careful to map your design wel tthe de- 
‘ice primitives, you should not nee to erate through the source 
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[EIEN systom-evel diagram of an iP-centric FPGA design. 
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Example of using an IP block in a highove 
programming envionment (LabVIEW FPGA). 














cde. Ta maximize the speed performance and minimize the ares 
atthis tage, you mast know whiek buttons to pus in synthesis and 
be familie with the back-end tools. In large desig, hierarchy is 
important fr organization. You can keep the boundaries between 
modules intact or you can allow the tos to blur them, 
Depending on the locks tht make up your design, maintaining 
the hierarchy for some pars ofthe design but not others may be wise 
‘When a core has area or placement constraints 
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ct the hack-end tools tnd the optimal place- 
be phe ae ci) ‘et for registers within a combinatorial path 
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an addtional speed boos. 

Understanding the tools 2ailale to you as 
.wellasyourhardware capabilities helps you pro- 
doce efiient and robust designs. While creating 
‘astm furctions and iteraces is part of ay 
desiga, keeping reuse in mind hes you protect 
the investment on work you have already done 
High-level tls can assist you in implementing 
this concept when sing hierarchical and mod 
Jar design flows. Finally, with IP cores, you can 
dqickly crete complex designs when your = 
sources and tims are limited. 
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Embedded FPGA Soft Core 
Processor Enables Universal 
CompactPCI Applications 


by Pat Mead, Altera 
Barbara Schmitz, MEN Mikro Elektronik 





sgrammable logic has reached such a state of advance- 

vent in terms of speed nd density tht it has become & 

truly attractive alternative to RISC and CISC processors. 
Ie can form a “matrix” within which processing, peripherals, 
data paths and algorithms can he placed to create powerful, 
flexible and upgradable systems. Programmable logic is now 
available in forms and sizes that range from the traditional 
‘se as glue logic up to structured ASIC replacements and even 
further. To dae, to fully use the advances of this key technol- 
ogy—re-programmability, reusability and upgradablityyou 
‘need tobe a FPGA expert, but these henefits need to be opened 
toa mule wider market. 

A Nios-ll-CompactPCl development package designed as an 
‘open FPGA platform includes «sample design with PCI sytem 
‘uit integrating the standardized Wishbone bus and the Altera 
‘Avalon switch fabric. The PCL system unit forms the interface 
tothe PCI bus, where the CPU board can then be addressed as 
PCI slave connects tothe Wishbone bus where a SDRAM and 
flash controller are already implemented 

The 3U CompactPCl card with a Cyclone FPGA. and the 
integrated Nios I microcontroller softcore is designed fr final 
use in volume in production and it acts a the same time as the 
standard FPGA development platform for this application (ig- 
ture 1). Asa universal FPGA platform, the board has a multitude 
‘of directly accessible UO pins, The Nios IL CPU in the Cyclone 
PPGA provides performance similar to an ARM processor. It 
allows the use of the CPU board, for example, as an intelligent 
slave on the CompactPCl tus, The FPGA and the integrated pro- 
cessor core support a 32-bit bus with 33 MHz, control 32 Mbyte 








SDRAM and support read and write to and from the 2 Mbyte 
flash memory. The special Aish structure provides initia pro- 
_sramming using a boundary scan interlace. Once configured, the 
FPGA may be reconfigured at any time during operation with 
data from the CompactPCI bus, 

“The FPGA also controls four status LEDs and up to 83 user 
defined VO pins. The final functionality of the board depends 
tentirely om the application and can he anything from simple 
ART solution up to & complex analog front end with DSP-like 
data pre-processing. In any case, the CompactPCI card supports 
‘a ncirly endless range of applications. The designer ean use IP 
‘ores o configure the function, This includes different serial i= 
terfaces from RS-232 to intelligent HDLC protocols up to Fast 
Ethernet. Other functions incse graphics, ildbus connections 
‘ordigital VO. On the other hand, a development package included 
lls the userto create custom cores orto integrate third-party 
‘ores om opencores.org of frm Alters 


Open-Platform Development Concept 

‘The wir can now ad any kind nd numberof IP cores to 
the Wishbone bus. To do ths,» “Wishbone Bus Maker Tool” 
tas been developed, which can be used to generate the Wish- 
bone bus and which is part of the development package. The 
Wishbone Bus Maker can generate mult-mister and mli-slve 
tus systems. A Wishbone-o-Avaloe-bridge, and vie versa, an 
“Avalon to-Wishbone-bridge (Figure 2 allow the witiona in 
tegration of Avalon-based IP cores an especially ofthe Nios IL 
soft processor core. Nios I cannes othe Avalon site fab- 
Fe, where # GPIO module for the user LED controls already 
Implemented as wel. The user can now also add any kind and 
umber of IP cores to the Avalon switch fabric by using the 
SOPC Builder too fom Altera, which s part ofthe Quartus IL 
‘evelopment package, 


































‘The open-platform concept for Nios-based develop- 
ment and integration of IP cores for Wishbone and the 
Avalon switch fabric can also be expanded on the han 
ware side. Currently a new generation of ANSI standard 
PMC and M-Modales is being designed that uses the Cy- 
clone I family. The entire board logic including the in 
Vidually configurable FPGA is located on the base PMC oF 
M-Module, while the physical interfaces are implemented 
‘on an adapter card, which can be plugged either onthe PMC 
for the M-Module, Since both PMC and M-Modlles can be 
used on al kinds of system platforms using dedicated car- 
rer boanls, the user can now concentrate completely on the 
PPGA application. 


Flexibility—Base for 
Cor Applications 

‘An example application is now in use in 
the automated driverless underground, A leading 
manufacturer is using standard 19" systems. These 
redundant builtup 3U-CompactPCI systems feature & 
Pentium III CPU, analog and digital VO. sensors for postion 
encoders and an optional MVB link. A feedback channel al- 
lows sending back data from the vehicle permanently to the 
‘central control station, 

PPGAs are used in three different boards. The FPGA of the 
(CPU boand contains a watchdog and different UARTS: the NAND 
flash is controlled by the Nios Il microcontroller. A second board 
features eight UARTs completely integrated in the FPGA sed 
for asynchronous RS-422 operation and optional synchronous 
HDLC (without Nios) The third board features digital 1s, ana 
Jog outputs, counter pulses, radar sensor and interrupt inputs—all 
implemented in the FPGA hardware without using the Nios IL 
‘This standard CompactPCI system can he configured for the 
requitements ofthe operation of the underground trains in dif- 
ferent cities and countries by only changing the content of the 
PPGA.the hardware remains the same. 

‘Modern low-cost FPGA components have a usable size 
‘The Cyclone I family from Altera contains nearly 70,000 log 
clements and ther pricing is acceptable starting from just afew 
dollars for the smaller devices. This makes them effective 
factors for cost savings and time-to-market when making indi- 

















vidual configurations of standard products. A time-consumi 
and expensive redesign of a board can often be avoided through 
pplicstion-pecific integration of IP cores in the FPGA. Fur- 
thermore. FPGA technology is indispensable wherever long- 
term availability or harsh industrial environments are involved 
IP cores per se are not threatened by discontinuation, even if 
an FPGA component may be replaced by a newer one after 10, 
sears, for instance, 

‘The Nios Il family of 32-bit RISC embedded processors 
delivers more than 100 DMIPS of performance when imple- 
‘mented in the Cyclone Il family. Because the processors are 
‘soft core and flexible, itis possible to choose from a nearly 
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“The FZ06N from MEN Miizo is an example of @ 30 
‘CompactPC! board based on Altera Cyelone i. The 
‘unetionalty of the Board is entirely dependent on 
‘the IP programmed inte the FPGA. 


© Feotran 


‘Bridges implemented inside the FPGA alow 
Integration of standard IP cores along with Altera IP, 
Including the NIOS I softcore processor connected 
to Altera’s Avalon switch fabri. 
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‘unlimited combination of system configaratons to mect the r= 
{ied performance, features and cos. The Nios Il processor 
family consists of three cores—fast (Nios IL), standard (Nios 
Ws) and economy (NIOS I/e)—each optimized for a specific 
price and performance range. All three cores share & common 
2-bit instruction set architecture and are 100 percent inary 
cade compatible A library of commonly used peripherals and 
interfaces is included in the Nios II development kit. A com- 
plete list of SOPC builder-ready intellectual Property (IP) and 
peripherals can be found at the Altera Web page. Using the 
imerface-to-userlogic wizard in the SOPC Builder software, 
enables the creation of custom peripherals and their integration 
into Nios II processor systems 


Optimized Use of Bus Resources 

“The Avalon switch fabric enables multiple simultaneous data 
‘transactions for maximum system throughput, SOPC Builder su 
tomatically generates an Avalon switch fabric optimized to the 
speciicilerconnect requirements ofthe final system processors 
and peripherals In traditional bus architectures, a single arbiter 
‘controls the communication between the bus master and slaves 
[Each bus master requests control of the bus, and the asbiter then 
rants bus access to single master If multiple masters tempt 
taccess the bus at once, the arbiter allocates bus resources 0 a 
‘master hased on a fixed set of arbitration rules, This can lead to 
«a andwidth botleneck as only one maser ca access the system 
thus and its resources at a time. 

‘The Avalon switch fabrc’s simoltaneous multimaster a= 
cfitecture increases the system's bandwidth by eliminating this 
bottleneck (Figure 3)- Using the Avalon swith fabri, each bus 
master gets its own dedicated interconnect, meaning that hus 
‘masters only contend for shared slaves, not for the bus itsell 
Each time a component is added or the peripheral aes priori 
ties change, SOPC Builder generates a newly optimized Avalon 
‘switch fabric with a minimum of FPGA resource use. The Ava- 

switch fabric: wide range of system architectures, 


‘Avalon switched fabric architecture compared to traditional bs architecture. 

















including single- and multiple-master systems, and allows 


seamless data transfers berween peripherals with performance 
optimized datapaths. OFF-chip processors and peripherals are 
equally well supported 

‘Castom instructions allow developers using Nios I proes- 
sos to increase sytem performance by extending the CPU in 
struction set to accelerate time-eritical software. Using custom 
instructions enable the optimization of system performance in 
away not posible with traditional off-the-self processors. The 
[Nios family of processrs supports upto 256 custom instruc- 
tions to accelerate logic or mathematically complex algorithms 
‘normally handled in software. For example lock of logic that 
performs a cylic redundancy code calculation on a 64 Kbyte 
buffer operates 27 times fsteras a custom instruction than when 
performed by software 

[ios Il processors support fxed and variable cycle opera- 
tions. include a wizard for importing user logic as a custom 
struction, and automatically create software macros for use in 
developers’ cose Large blocks of data canbe processed coneur- 
rently with CPU operation by adding application specific hard 
‘ware aceleratrs that act as eustom covprocessos within the 
FPGA. Using the eylic redundancy code example, processing 
164 Kote buffer runs $30 times faster with hardware accelera- 
tors than software. SOPC Builder includes wizard that allows 
developers to add their acceleration logic and DMA channel to 
the system, 

‘A complet st of too is available forthe hardware design, 
incuding the SOPC Builder system development tool, Quartus 
I design software, ModelSim-Altera software and SignalTyp Il 
embeded! logic analyzer. Hardware design for creating Nios UL 
processor based systems uses the SOPC Builder system develop- 
‘meat tool fo specify, configure and generate systems. Launch 
{ng from within the Quartus Ul design software, SOPC Builder 
provides an intuitive wiza-riven graphical user interface for 
creating, configuring and generating sy3tem-on-a-programmi 





To make the software design flow as easy as possible itis 
possible t accomplish all software development tasks within 
the Nios IL IDE, including building, debugging pro- 
grams and flash programming. As part of the Nios IL IDE 
Altera pariners with operating system and middleware provid 
crs for additional software development tools. A PC, an Alla 
PPGA device and a JTAG dovnload cable is everything you 











need to develop and debug Nios II proces 
sor ased systems. The Nios architecture 
supports a JTAG debug module that pro 
‘ides on-chip emulation features to control 
the processor remotely from a host PC 
The IDE ean communicate with the JTAG 
This 





module on one or more processors 
allows dowaloading pi 





i stopping program execution 
ing breakpoints and watch poins, ans 


ers and memory and collecting 








The instruction set simulator (ISS) 


pro 
hardware platform 





makes it possible ta bet 
s before 








is ready. ms also incorporate 
flash memory on the board 
(CFL-compliant fash dé 


the FPGA. can be po 


Therefore any 









IDE fash programmer. The flash prog 








meri pre-contigured to work with al ft 
boards available withthe Nios Il develop 
ment kits, and can be easily ported to any 
ccastom hardware. In addition to a project 

up wizard, the IDE. provides software 
cede examples, in the form of project tem: 











Plates to belp bring up working systems as 
‘quickly as possible 
The IDE enables quick system 





‘castomization using system soft 
hardware abstraction layer (HAL) library 


e. The 


is a lightweight runtime environment tha 





provides a simple device driver interface for 
Programs to communicate with underlying 
hardware, MieroC(OS-II from Micrium is 3 
complete, portable RONL-ble, pre-emptive 





reabtime kernel, shipped with all develop 


rent kits and inclades full source code, ref 





Included! in the development kit is also an 


i fee developers licence, 





‘open-source IwLP TCP/IP stack that is built 
to work with the MieroCiOS-II applications 
and implements the standard UNIX socket 
AAPL as well asa full-featured LINUX oper: 
ating system. Ed 
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FGPA-Based Development 
for Defense and 
Aerospace Applications 


The right FPGA design toolkits not only speed development, but can simplify 
the addition of custom IP so designs can be tailored to specific applications. 


by Steve Edwards 
Curtiss-Wright Controls Embedded Computing 


he parallelism, speed and 1/0 flexibility provided by today’s 
FPGAs make it possible for system engineers to replace 
‘multiple processor boards with a single FPGA COTS bord, 
For the defense and aerospace market, where high performance 
frequently mast be traded off with power and size! weight resric= 
tions, these FPGA boards offer the hes of both worlds: high per- 
formance ina single slo. 

Formerly, FPGA solutions had the reputation of being costly, 
‘duc to lang development eycls and high development costs com 
pared to traditional software-based soltions. But today’s FPGA 
evice families, combined with appropriate FPGA design kits, 
help engineers get designs to market rapidly. In addition, they 
offer lower costs and greater flexibility that simplifis adding the 
system developer's intellectual property (IP) 

‘One ofthe prime advantages of today's FPGAs isthe balance 
they provide between processing and 10, This balanced approach 
makes FPGAs very efficient t simultaneously processing several 
highspeed, parallel data streams, Such UO versatility means that 
_multiple banks and types of high-speed memory, including DDR 
‘SDRAM and DDR SRAM, can he connected to an FPGA, 

“Many newer families of FPGAs, such as Xilins's VirtexL 
Pro and Virtex 4, feature high-speed serial transceivers, each 
capable of throughput of 3.125 Ghitss or greater, These trans- 
Ceivers support many’ of the high-speed serial interfaces used in 
‘emerging defenselaerospace applications, such as Serial RupidlO 
(SRIO), PCI Express (PCIe), 10 Gigabit Attachment Unit Iner- 
face (XAUD) and Gigabit Ethernet. 


‘Got Connected wih conpars mertoad i Bisa 
‘worstomagazine com/getconnected 





‘The combination of parallel processing wih fas, syncheo- 
‘nous memories and high-speed serial ranscevers lets system de~ 
signers replace multiple processing cards witha single card con 
twining two oF more FPGAs. This results im systems with lower 
power requirements, weight and cot 


























FPGA-Based COTS Boards 
FPGA-hased COTS boars targeted tothe defenseiserospace 
‘market often share certain common elements, For example, they 
all provide one or more high-density, high-performance FPGAs, 
high-speed serial YO and high-performance memory. 

But hardware architecture is only part ofthe solution. To 
realize the full value that FPGAs can deliver, the success of an 
EPGA-hased project is dependent on having the right FPGA de- 
sign tools, Some of these ean shorten time-to-market and lower 
development cost, while ano providing an open architecture that 
_makes it possible to tailor the design to specific aplication. 

‘Systems engineers evaluating FPGA boards must address 
certain challenges. These include how to add their algorithms 
to the FPGA and how to simulate the design atthe system level 
to ensure that these algorithms work. Because FPGA develop- 
ment can be costly, engineers also must evaluate wheter their 
approach will save time and money’ an ifthe resulting solution 
‘wl be robust and reliable 

‘Tounderstand the importance of design tosis useful t0 
considet how data is handled by an FPGA board. For example, 
‘ane board (Figure 1) uses two Virter-ll PRO VP-70 or VP-100 
FPGAs, sveral high-speed serial interfaces and a large amount 
of high-speed memory to achieve a high-performance, reeontiga- 
rable computing engine. The combination of DDR SDRAM for 
bull storage and DDR SRAM for fast, non-scoential stonage of 
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[EMER curse wign’'s CHAMP-FX architecture balances high-speed processing, memory and /0. 


lgorith data allows Hexbiity im mapping algorithms 0 this 
{atypical application, dat comes in vin ahigh-speed serial 
incrface als tre in SDRAM. This datas the pled out of 
SDRAM by the end application and processed via the customer's 
algorith.Inermediate results are stored in internal or external 
‘SRAM and the fn result is decimated toa lower data rate 


The FPGA Developers Kit 

For DDR SDRAMS, FPGA designers must confront the 
challenges of aligning data and data strobes, tight timing con- 
strains, signal integrity issues and simultaneously switching out- 
put ($S0) noise. In addition, certain design issues can prolong 
‘design eyes or force them to accept reduced performance 

‘To make matters worse, all of these hurdles become more 
pronounced at high frequencies. Ona ead of data from SDRAM, 





the data val for ony wo to hice nanoseconds Significant 
effort is required to lth it reliably inside the FPGA. and then 
Sychronie it with the ret ofthe ogi there. The FPGA de- 
Signerin thas faced with sigan challenges hat may aks se 
ral man-months to complete, bt that canbe sled by sig the 


IP insome FPGA developers kits, 


provided is fixed to certain regions within the FPGA (Figure 2). 
‘This is done to ensure that al ertical paths meet timing, as well 
sto confine the overall IP design toa small egion ofthe chip to 
‘minimize logic sources. 


FPGA IP Designed Specifically for the Hardware 

Designers must also make sure thatthe IP works with the 
shardware, One potential challenge isthe iste of SSO noise. Ia a 
Xilinx FPGA, whea too many outputs ina particular bank switch 
at the same time, simultaneously switching output introduces 
‘ise into the system and causes one of more bits in particular 
‘bunk to lip tothe wrong valve 

‘This eam be a tricky problem to track down if the designer 
isnot familiar with the SSO phenomenon and withthe particular 
board hardware involved. The FPGA board supplier should test 
the board with SSO in mind, and its memory interface pinouts 
should be selected to avoid this issue. 

‘A DDR SDRAM read cycle provides a good example of 
hhow the FPGA IP and the bourd hardware must work together. 
In onder to read data, the FPGA sends out a clock signal to the 
SDRAM and waits two clock eyeles fora return fourword burst. 
‘The challenge is how to clock the data bac into the device. Be~ 
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‘To minimize routing delays and improve timing, /O 
IP blocks are located along the edge of the FPGA 
rear /0 pins. 


‘mitted clock signal and the incoming data. This snot an optimal 
design approach Because skew minimizes the window during 
‘which valid data can be elocked into the FPGA. 

‘A etter approach is 10 compensate for trace delays in the 
[FPGA by phase shifting the internal clock relative tothe incom 
‘ng data, This allows the PPGA to clock im read data with the 
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[GEM] An internal crossbar switch is used to connect 
User IP to IP supplied in the CHAMPtodls-FX FPGA 
Developers Kit. 


appropriate timing margins. However, to do this the FPGA IP 
designer must he intimately familiar with the PCB. charactris- 
tics ofthe FPGA-SDRAM interface. 

‘Some argue that, to simply the design process and reduce de- 
sign eyee time, designers should use memory-controller IP cores 
provided by FPGA vendors or third-party suppliers, This isa valid 
assertion if understood in the right context. FPGA IP that has been 
‘designed for particular hardware platform withthe end applica- 
tion in min, and that has been tested and qualified for use in that 
application, will certainly save the designer time and money. 


Interfaces and IP Integration 
‘The IP in the FPGA. developers kit is extremely important. 
For example, IP in the CHAMPtools-FX FPGA Developers Kit, 
for the Xilinichased board described above, is 
tuilt around one of two standard Xin inter. 
faces: IP Interconnect (IPIC) or Local Link. IP 
Interconnect is a standard memory-mapped in- 
terface that allows the IP block to send data to 
4 particular memory address. SDRAM, SRAM 
and PCLP are provided with IPIC interfaces 
Local Link isa streaming data interface in 
‘which the dat's destination address is predeter- 
‘mined, Local Link interfaces are provided with 
DMA controllers so that engineers can setup a 
destination buffer toa memory-mapped location, 
‘As data comes into the Local Link interface, the 
DMA stutomaticlly sends it to the correct men- 
ory address, 
‘Local Link interfaces usually include Rock- 
10 IP. Both of these interfaces must be well 
documented in the FPGA developers kit. This 
documentation should describe the signals for 
cach interface, as well as timing diagrams and 
anything else needed to develop custom IP using 
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This block allows other IP blocks with IPIC or Local Link in- 
terfaces to be connected together so that data ean be transferred 
among them (Figure 3), Multiple IPIC switches ean be instani- 
ated in the design allowing engineers flexibility in choosing the 
best solution 

Finally, integration is easy when all of the IP can be in- 
tegrated within the Xilinx Embedded Developers Kit frame- 
‘work, This allows designers to easily instantiate or remove 
IP blocks and synthesize the design, using a GUL-based soft- 
‘ware framework 


‘Simulation and Hardware Test 

Design verification, including functional 
fication, typically consumes the big 
velopment cyele. As design complexity increases, this causes @ 
dramatic increas in simulation time. Many EPGA designers rec 
‘ommend budgeting 50% of IP developmen to simulation, Some 
ofthis time is spent simulating atthe IP block level, but a good 
portion is also spent in Full simulation ofthe entire design. This 
fs the last hurdle forthe FPGA designer before testing the design 
‘on the target hardware 

‘Afterall of the bugs have been removed from individual 

blocks, the design needs tobe put together and simulated, This 
must be done before testing in silicon, 
Simulation requires a testhench environ 
rent that includes models ofall ofthe ex- 
ternal interfaces It also requires the ability 
to initialize memory interfaces, pass data 
from the PCL and RocketlO ita the FPGA, 
and check memory, aswell asthe ability to 
test PCL of RocketlO data against expected 
lata previously stored in files. The FPGA 
developers kt must provide all the features 
‘needed for a full functional simulation of 
the FPGA. 

‘Today, many FPGA algorithms are de- 
veloped using # tol such as Matlab, which 
ceils designers to input real data into 
‘thei algorithms and capture the reslts in a 
file, These inj an output results ca ten 
the reused during simulation 

‘The simulation environment provided 
by the Xiline-hased board's developers kit 
lets engineers input data from a fle into the 
simulation from the boar’ external PCI or 
RocketlO interfaces. Simulation data can 
be stored to a file at any of the memories 
for the PCI or RocketlO interfaces. These 
results can then be compared to result data 
captured during the Matlab simulation, en- 
bling developers to verify the correctness 
of their VHDL IP model versus the math- 
ematical model originally created in Mat- 
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lab, The wblty to check dats at any of the memory interfaces 
particularly useful for allowing developers to check integrity at 
multiple stages within an algorithm. 

The final stp is testing the design on the hardware, At this 
point the design is almost complete, but there is stil the possibil- 
ity thatthe test environment didnot test for all possible condi 
tions. or that here is some variation between similation and the 
actual hardware, The Xiinicbased board's developers kt makes 
it possible to automatically insert Xilinx Chipscope IP onto ci- 
the the IPIC or Local Link interface and specify which ports to 
monitor. This isan easy way toad a cireut logic analyzer func- 
tion to the FPGA design, especialy at boundary points between 
‘castomer IP and IP provided by the development tools vendo. 

‘System engineers ced the right tools in their toolkit to com 
plete the job well, on time and under budget. The ideal FPGA. 
{developers kit provides the right tools forthe system engineer to 
design a rabust,reliale product on ime and under budget Iti 
an absolute necessity forgetting the job done. Ed 














Curtiss wright Controls Embedded Computing 
Leesburg, VA 

(703) 778-7800. 
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Slice Architecture Tackles Growing Thermal Demands of High 
Performance 

‘Themis Computer bus announced is new Slice switched computing inate, de- 
‘signed w mec the escalating demands fr thermal an Lintic management imposed by 
ext generation high dens high-performance misson-ital computing. A processoriadepen- 
‘eat achietue, the Themis “Slice” platform allows users o mis, match and manage SPARC and x86 archioe- 
tures, Solas, Windows and Lins operating system. in combination with thir-party network serves, storage and switches, 
‘Quorum, Themis eal-sime, plicy-bsed esouree manager. ensures contacted application quality of service (QoS) fr hsterogencous computing 
‘esoures, Designed for hishtdeast. high-performance computing. the Themis She architecture issued fr those who ate looking fr highly 
‘ailable, horizontally scalable processing power and lower lie eyele cost f ownership 

‘Themis Sie i fered in ae and guid cooing varias tat provide thermal headroom to accommusdate aggresive sealing of commercial 
Imieroprocessor core density, speed and power. All Themis Slice elements ate inherely rugged, have 4 uniform mechanical oot und sta 
‘Grd Fak eight (LRU) and depth 2") Upto the Slice clement, scluing& common Power Slice ca he combined in aSRU docking satin, oF 
‘Subeack. The Subrack blind mates with connectors onthe Sic element, roving power Uist bution cable management apd dpless coupling or 
liguid cooling ofthe consinuent Slice Subrack modules. The Suback allows liquid and ai-cooled Slices tobe inteemined, within single docking 
‘aio, providing blind mated dries couplers to cooling liquid mniols inthe docking station, Subracks canbe interconnected, using external 
‘Sites to configure highly scalable gr computing systems 

Within the Subvack, Sie elements plug into an InfiniBand high-speed, lowatency Svial switch fab, so that casters of upto fie Slice 
ements canbe interconnected, without extemal switches, Processor Sees interface with General Purpose HO (GPIO) Slices over a PCL-Epress 
high-speed serial fabri also supported bythe Subrack, Other Cluster elements ince torge Slices an four pot Cig-E Target Chunoel Adapter 
‘orcad inthe Power Slice 

Upto four Processor Slices with up o 64 cores canbe conigured ina single SU Subrack. Te cluster fabri, intemal to the Subrck uses 
Infniland switches and links fo superioe (ow) memory-o-memory transfer latency and scalable bandwidth Suback cstrs can be conigured 
thing ier GigE or InfiniBand, Themis Slice Subracks can be combined with IBM or Sun Sever, in heterogeneous networks, sing either 
eeworkng tchnolog. A the Subrack lee, Themis Slice technology is «tly open aehitectre tat offers superior SWAP and eavironiental 
resileny. Processor Slices ae priced from $1000, Processor Slice configurations bundled with the Subrack docking station and power suppl, 
ate priced fom $26,000, in OEM quate, 
‘Themis Computer, Fremont, CA. (520) 252-0870, {wwwt.themis. com 


2.0 GHz CPU Performance Plus Data Acquisition 
‘on a Single EPIC Board 
‘A210 GH processor eas with auo-aibaing A/D ciety om 
4 single EPIC form Sactor bord Yo conserve space a edie ofr 
Aplications that ned o combine high-speed CPU with data suis 
th capa. These include rita systems tansporaion yates, 
insole coool, medical instrumentation and industrial cool ys 
tems. The Poseidon fom Diamond Sten ofes the VIA Edea ULV 
forthe VIA C7 processor along with USB 2.0, Cigabi Ethernet, SATA. the Freescale HC(SIO8 MCU. Mes- 
fas storage andthe sufomatcally calibrating A/D interac. suring 355 fo, x 265 in x 05 in 
‘The Poseidon SBC is offered with ether a fanless 1. GHz Eden the MCBSE typically draws a low 
ULCoe20GHe VIA C7—bothwithomshipcache and 400 MHz fron. 300 mW while running ats fastest 
side bus slong withthe VIA CX700 chipset. seeds and executes with a 50-nano- 
“The boul comes wih 256 Mbyte er S13, second instruction ine. The boars 
NyCSISNINEDDRERANoldered wide aseimen of UO inches a 
mu the boar. The chipset ne- PUNO sot an RS-485 serial port 
pres te VIA UneChome ft soled digital omputs four 
Pro2DISD pophisconl,  olaed digital ints and 24 additonal lines of TTL-Aeve iat UO, 
te with MPEGS? hardware Files allow four channels of PWM to ct sour independent DVA con 
scelonlien CRT acl LDS. ters. Abo ichded are an oaboal temperature detector, «realtime 
farpane tapevtand dil dplay clock CAN interface and 16k serial EEPROM. 























‘Small, Low-Power Controller Board Has CAN 
‘Im systems witha large amount of industrial 10 in tight, remote 
pues, rca be diticul oad functonalty oF tun specie function 
09 oF of without having t reprogram the entie sytem. A RoHS-<omy 
lia oard from Micros allows exactly that 
The MCHSE, a now member 
of the Mictoiys SNAP series of 
‘msoeonteller boards, i based on 





















capability Incl a alo four USB 
20 poms, four RS-232 serial ports (v0 
with WS-2485 cu pail) and both IDE and S-ATA interfaces. 
‘Sinpl-un pricing start at $700 

Diamond Systems, Mountain View, CA (650) 810-2500. 
[eww.diamondsystems.com]. 
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‘The boas contains) Kbytes of pogramidata Mash and 4 Kbytes 
of SRAM. Mctowerks’ CodeWarrior compile is included ia the SDK, 
Single quantity pricing is $98 


Meroe, Montoso, CA, (818) 244-4600, www ombeddodsys.com} 


Programmable Internet Controller for Machine- 
to-Machine Applications 
‘A low-cost programmable Itrnet Comoe chip lets dhe deel 
oper determine is pit-out and funetonalty by is “Armvare Nave” 
‘The {Chip COI208Q from ConnectOne ships fom the Factory wit the 
asi boot block. Designcrs can then choose the frm are Mav that 
includes the features and Ineuet protocols (IP) needed Tor toi ap- 
lation specie designs. The COI20SQ' lw power consumption ad 
frost updates make it suitable for devices such as point-of-sale 
terminals, gateways, medical devices, ulty meters and bundbeld de 
‘vows tha se wired or witless LAN, 
‘The Gt-pia TQFP COS is targeted wo moet machine: 
chine (MEM) application connectivity needs 
Designed for $0210hjg WiFi and 10/100RaseT 
LAN access, CO120' firmware Mayors enable 
‘combination of upto 10 TCP/UDP socks 
‘won socket ints foe HTTP. SMTP 
sand FTP, and the SeralNET por seer 
copenting mode for seriabto-P bidgng. 
Acoice of kay or eemesely updatable 
‘mae is availble, and protocol maintenance 
ise fom Connect One. Finme favors fr cellu 
and ap enon applications wil be avaable in Q3 
"TheChip works wit ny ost processor an any wo host RTOS. 
‘Since iChipoflas TCP1P communications fom the host application, 
‘minimum developmen time oe IP expertise required. Desishia tine 
‘ypicll is oae mun-mansh. Pricing i 11.25 for ver 10.000 units De- 
‘lopment board ILEVE-SH for WiFi applications sll for S480 and 
ILEVE-SOU for LAN sll for S275, 


‘ConnectOne, Poenis, AZ. (408) 986-9602. [ww connectone.com) 




















Drop-in ETX Replacement Module Solves EOL 
Problems 

‘Sever not-RoHS ETX module sed ina wide ng of high-vl- 
sume commercial applications recently reached end-ohife (EOL) stats 
‘whic has left stranded developers of systems base an these boas. To 
telp keep OEMS in production, Ampro Computers has inraduced an 
ETX module that drops into exiting baseboard design. 

Ualizng the new RoBS-compliant AMID Geode LX 800 single 
hip iterated processor and Noetbridge, th EX 610 targets appli- 
aos ranging fom buildin automation to veting machines. The 50 
Miz Geode 800 is inept with 2D gripes memory controler 

ad a PCI beige. To filiate floss systema 

‘designs, the CPU has alow thermal design 

power (TDP) rating of 39W. The module 

‘sso coins DDR SODIMM RAM to | 

Gt, as well as USB 20 ports, EIDE 

tnd Sovak ATA (SATA) atsfsces 

for migration, 1/100 Mbit Eterset 

ACPI power managereat, PCI ex 

pansion and ISA bus expunsion foc custom 

clrcley 0 the ETX baseboard. LVDS ft pane 
LeDs at supprted 

EETX 610 QuickStart Kits include drivers and SPS for Widows 
XP, Windows XP Enbedded and Windows CE 50,8 well at fll 
Linus 2.6 distin (Fedora Cove 3) Modules wil hep shipping 
by late May- Pricing for peouction vlumes iin the lw $2005 
‘Ampro Computers, San Jose, CA. (808) 360-0200, 
[www-ampra.com, 


Low-Power Rugged Pentium M SBC in Single 
‘CompactPCI Slot 
‘Trguod fr embedded, rugged applications with lw power 
sunngtion the Penn M-based CRM om Dynatem has Pentiamn 
MM pocesior that uilizes a now micro architecture to meet the euteat 
and fume demands of high-performance. lo 
power embedded computing. making suitable for 
‘Sommunicatons afd industria automation apple 
‘ations. fetus advanced ranch prediction e3- 
pili, miro-ops fasion fornpoved instruction 
‘execution, and adedicate hardware tack manager 
that emplays hardware contrl for improved sack 
management, 
Onboard CompactFlash permits single-slot 
booting. VO sowed 1 the backplane includes an 
EIDE por, 1wo Serial ATA ports, two Gb Esher 
ee ports (PICMG 216-compable, BVOIVGA, 
Four USB 2.0 pots, two COM ports and two PMC 
‘expansion sites. The RSSGME chipset offers ite 
ated, high peeonmance graphs with resauons up 1 1600 1200, 
“The CRMI complies with VITA 301-2002 50 it cons with top and 
oon cooling plats tht are bond to the mor components through 
thera condcton and the hea conducting priate leu ba ne 
‘chanical. Dyna offers board support packayes fr such poplar op 
‘ating systems as VaWorks, Windows NT, Windows XP, Linus, QNX 
land RIX. Pricing for the CRME stare ut S300 in single quay 


Dynatem, Mission Viejo, CA (849) 855-2235. www dynam. com), 





High-Speed A/D VO Card Offers Four 105 MHz 
Channels 

Geting 
‘an boa big challenge. new high-speed analg-o-dig 
fron BieWate provides ou 
channels of 105 Mi A/D 
‘omvetson and a reconfigu 
fable FPGA. 

‘The Tere PMC+ A/D 
Wo ca festues out high 
performance, bit wide 
and A/D converters unning 
typo 10S MHz. These seam 
‘ata diwily wo an Altera Cyclone IL FPGA for A/D cont dissibu- 
tiga of converted data and front-end processing. Data pre-processing 
functions can be configured to enable digital iering. decimation and 
‘Giga down conversion, Data can be transfers to the baseboard over 
the PMC inertace by the Cyeloe I 

1m addition to complete software development ool tht allow de 
‘signers to easily deelop application code sn intepate the Tea PMC 

other ste, BigWate offers a Teta developer's it forthe Cyclone 

1 tht includes source forthe A/D converters and the Link ports. The 
“Tear PMC+ will be salable in Q2 of 2006 at is peice of 3.958. 


BittWare, Concord, NH. (603) 226.0404, [wwa.bttware.com]- 


‘Gol Connected wit careane ani pax kabrdin ie on 
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al wodd signals into the signal processing envionment 
tal VO board 
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‘3U Conduction-Cooled CompactPC! SBC for 
‘Severe Environments 

A new, ged conductin-cooke 31 cPCL Single Board Computer 
(SBC) provider full system health montoring and reporting, ant. 
ing all PICMG 29 speciieaons. The ROCK (CCI) tro Genotal 
Microsystems tatgetsexteme applications requiring high performace, 
low power consumption and operation in extreme temperatures. The 
highest performance version is dven by the GH Penta M738 
processor with 2 Miytes of L2 eace. The micocoatller and FPGA, 
$e operational below und above the sated board operating tempers 
lures C40" 0 485°C). The ROCK is als stile ina convection 
cooled version (0° 0 450°C), 

A patent-pending design uiliz- 

ing 2 mirocontrller per. 
fous the aseboard man 

sgoment coatller (BMC) 
funtion: per PICMG 29, 
reports the results ofthe bain 
test (BIT) and extended bulein test 
{(EBIT), and monitors and controls the base- 
‘bard teripeate, The CCélx comes with upto 
2 Gbytes of ECC memory and up 10 16 Gbytes of fash meme. The 
ROCK provides dual Gigabit Ethernet witha TCPIP offload engincon 
PCLX (66 ME2164-bi) bus. This dua-chansel Gigabit deve provides 
tye Gigabit wire spoods in full duplex mode and ilies frton of 
the CPU bandwith whon fully loaded. Supports available under Wis- 
‘dos XP.2000, Linus, QNX apd VxWorks, Quanity 108 pricing forthe 
‘ondcton-cooed version stats t $4,370 
‘General Micro Systems, Rancho Cucamonga, CA (200) 207-4863, 
[www gmsdsbe.coml 
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‘3U CompactPCl A/D Board Sports Eight 
Channels 
Saucezing high-speed acoustic technology used fr sonar and vi- 
ration analysis applications ino 3U CompactPCl forn-ictr is 20 
‘moun fet. But it has been doae by ICS Sensor Processing in the com 
pany’ new A/D converter board, to achieve substan redutions in 
Size and wight 
‘The 1CS-125 high peod acoustic A/D converter board features 
ight channels of high-frequency acoustic analog inpt and onboard 
‘ul coding with programmable gain. AU channels use the Analog 
Devices ADI260 16-i, high-speed oversampled A/D converte with 
uter memory to deliver 25 Msamples Onboard signal condoning 
fiminates the requirement for external signal conditioning logic. With 
‘maximum bandwidth of 125 MHz, 
the ICS-IT4S supports our lat elt 
age ranges 20 Vpp. 2 Vpp. 02 Vpp and 
(002 Vp aitferental) and 8 Mbytes 
‘of memory in evo banks. 
‘The antealas att ker 
feeguoney is txed at 128 Miz 
standat another frequencies canbe 
supplied oa request. Differential analog inp 
is povided vate foat panel while second font 
po coector males posible mulipe board syacheon- 
zation foe systems requiring high channel counts. Windows snd Linus 
“rivets te aailable Price is S4.235 in OEM quantities 
Interactive Circuits & Systems, Part of Radstone Embedded 
‘Computing. Ottawa, Canada. (613) 749-9241. [www.ies-td.com). 
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High-Density Digital and Analog /O Card For 
Industrial Apps 

‘Aigh-demty PC/O4 VO card inches 16-<hannel,12-e AID 
converter an #-hannel DA converter ai 4 ies of digital Hand 
{operates ina iemperature rage rm a0" to 480°C. The PCM-MTIOL 
fiom Winystems reyes no timpots fr 
calitaton of the analog cicuty to 
remain within spe. The abit wo do 
‘thou adjustment with. psenion 
‘tes ress in quick and easy setup of 
alg systems oedng accurate dis 
tally comet yokages—this abo 
eliminates the ned fo a id tes 
Clan vo perfor the setup 

‘The analog section ofthe oad is 
designed wiltow- ans powersup- 
pls and low-drift voage eeence aswel 
swith pil aout tectinigos and ow-if esitrnetvonks 
10 maintain fogterm accuracy and sab: The PCME-NUO is compat 
‘He with intl sigoal condones that wil pots. fier and isa the 
nbs inp and op signals fom seta aniets, AIL 48 digital 
1 toes are navi proprammable fo inst cup or oupatwith 
rexl-bck. Thelins are TTL-compatbleandcan source an sink 12 
‘which allows diet connection ody standard, optically slate AC 
sn sig cones. Drivers areal fr Lis, Winkows CE. 
nd Winds XP Exeded: Pricing tart at S395, 


\Winsystems,Afington, TX (817) 548-1368. (wwn.winsystens.com. 








‘SBC for Mobile Apps Has 5 Low-Power Modes 

Designers of mobile high-performance embeded systems, such 
= handheld and wearable computes or small unmanned vebices, 
feed to be able to fine-tune power consumption. The new BisyXb 
SEC ftom Applied Data Systems features Ine low-power modes to 
holp make the SBC powerstingy, as well a dynamic variable speed 
and voltage epuatian 

‘The compact, 3 ia. x Sin 
sy i based ate’ 32-6 
500 Miz XScale PXA270 CPU, 
‘with video ierace up to XGA, 
resolution, Up io 128 Mbytes of 
SDRAM progain memory and 
up w 64 Mbytes of Hash mem 
‘ory are provide. 124 Kbytes of EPROM. 
Forexpunsion and connectivity, te boat has PCMCLA Type Hints 
{aco the serial pots, USB por, an ltl QuickCapuurecameca Sen 
Sor iapat bus, 10 dial 10s. an SPL por, an FC bus and ADSimavtlO 
‘ith mie coatiguraleinpuslouspats. 

‘An onboud por supply hasan input voltage of SV or 6-16. The 
oan consumes less then IW during operation ad istuggedized at 1° 
tw #85°C. Windows CE_NET and Linus ate supported. The BisyXs 
SBC spice in the $200 
Applied Data Systems, Columbia, MD. (S01) 490-4007, 
[wwwappioddata net} 






is included asa boot dvi. 


Core Module integrates Wired/Wireless 
Networking 

‘As witless devices of all kinds ae increasingly connected ta wired 

engines ate challeaged with hading a sgl, secure 

Seltion fa both, Anew embeded cone nal from Digi Inter 
‘inept 100 Mbt Ethernet with secure S02 a 
typ witless LAN networking capabilites, 

Powered by Digi's 155 Mite 
ARMD-tased Netilicon NSO360 
processor, the CoanectCore 
WiC provides up 1 
25h Mbytes of iets 
SDRAMish metnory in 
4 compact SO-DIMM form 
factor. For 1 connectivity the Com 


aetworks, des 












nectCore Wi9C features USB, UART, 





FAC, SPI, PWM and GPIO inrfaces, Wireless security protocols sp 
pood include WEP, WPA and WPA2802.1 

The ConnectCore WIC is re-ertied,eliminatng cosy cee 
cation delays during product developmen. eis also RoHS compliant. 
Operating temperate range is 40" 40 -+85°C. Development kits for 
NETeWouks, Linx and Windows CE are avilable, The ConnestCore 
Wi9C wil be avllable in Q3 2006 stating t S149 each in quantities 
of 1.000 


Dig Intertional, Minotonka, MN, (952) 912.3444, [wm.dig.com 











Graphical Design Platform Available for ADI's 
Blackfin Processor 


Ee 





ncors who are domain experts often Want to develop appli 
snip 
fal platform all the May 
ftom’ algorithm desig and 
protoyping 10 deployment 
and test Now they ean do this 
Tor applications based on An 
log Devies" Blackfin prt 
cessor, Nationa nsruntents 
fas extended the LabVIEW 
staphial datalow develop 
den ovine by eleasing 
the LabVIEW Embedded Module fo ADI Blcka processors, 
The softwatefetures more than 140 Blacklinspesifc, bamtop- 
‘mized math, analysis and signal processing funtion. Integrated YO 
such a alo und video DA converters, A/D converters and codees ate 
provided, swells on-hip debugging and easy graph intersoanec 
tion via Ethene. The ule includes the ADI VisualDSP+-4C deel 
‘opment at! debugging environment fr low-level access st eaime 
ing and deployment dtectly to Blackfin 


side graphically in LabVIEW or simultane 








Interactive debug 
Enginoors can debu 








‘ously debug both tho graphical code and generated C source code. The 
‘module ships wih aplication examples such as audio, con, power 
‘moainrng and communiations an pevides easy coanectivity to NL 


{es and measurement hardware Pricing tas at S095. 


National Instruments, Austin, TX. (512) 683-0100, 
[ww.com. 


1) Set Connoctod wit compari rd pecs ete 
wonertomapazine.com/getcannacted 
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Evaluate Your Options 
Real-Time & Embedded Register now at www.rtece.com 


computing conference 





Enter a World of Embedded Computing Solutions 

‘Atland open-door technical sssions aspecally designed for those developing computer systems and 
timo -crttcal applications. Got ahoad with sessions on Embedded Linux, VME, PCI Express, ATCA, DSP, 
FPGA, Java, RTOS, SwitchFabri Interconnects, Windows, Weelass Connectivity, and much moc. 


Your Resource Opportunity 

Exhibits arangod ina unique setting to tak face-to-face with tchnical experts. Tabo-top exhibits 
‘make it easy fo compare technologies, ask probing questions and discover insights tha wil make a 
big diference in your embedded computing world. Join us fortis complimentary event 





‘Sponsors 
ASEON ENEA om 
coughs WL winonver 
2006 Locations 
Dallas Houston Boulder Salt Lake Gity 
June 6 June 8 dune 20 dune 22 
Dato Los Angoles Taped ubsin 
Tarerto score Washington DC Monte 
Calgary ndhoven ‘Sherchen ottawa 
Vaneouver BC Bit Shanghal Soatte 
‘San Diogo Paton Ahr Being Portnd 


www.rtecc.com 





Pentium M PC/104-Plus Single Board Computer 
‘A now Pentium Mebased single hoard computers targeted at ap- 
plcations rung substantial processing power a extensive eure 
Ina compat design, suchas media, vionis navigatinacking. 
tem monitoring and seeuty/horeland defense markets, The Chetsh 
from VersaLopic suited fo embedded contol aplicaons requiring 
‘ery sal tpn, which the 3.6" 34" PC/IOS- Ps Baad provides 
Saban onboard tues inclu to COM pots, two USB 20 prs, 
Earn IDE, LPT aio nd PS keyboudmoase suport The beard 
ao features integrated high-pecocmance video ouput With spor fo 
both analog monitors and LVDS fat pases, ‘The Extreme Graphics 2 
‘ideo provesor includes highspeed 3D rendering. fllznoton veo 
2nd MPEG-2 decoding 
‘The PC/DLPh imertice supports bth ISA 
snd PCT ion moles. Stand pas 
though connect alo the ato Be 
stackable wih ter FINO mos, 
Ina ao be wsdasaCPU no 
ie fra larger ster by paging i 
ino proprietary bse Boa that n- 
cludes pectic wer iO ciety. The 
‘Cheeunaioincaderacustomizabe, 
(OEM cahanced BIOS hat ep: 
seal cis designed wok With 
u embeded operating sims, ican 
‘Windows CEIXPPAT Lins, VtWrks. QNN, DOS al other 
reabtine OSs Pricing is bout 1.500 in OEM ust 
VorsaLorio, Eugene, OR. (581) 485-8575, 
{ww VrsoLegie co 









ETX COM Express Module Sports Intel Core Duo 
‘The now PICMG COM Express standard for computer-on module 
(COM) form-actrs promises higher performance and VO bandwidth 
‘ina much smaller space. new COM fem Kono, the ETXeapess 
(CD. leverages the Intel Coce Duo processor to provide extremely high 
performance forthe COM Express Basic Form Factor modules. 


‘The 9S mm x 12S mm EFXexpress-CD includes the Mabile ln- 
{el DISGIM chipset and the ICH7M Southbridge to deliver up to 2x 2 
Giz processor performance. supports upto 2 Gytes 


‘of DDR2-SDRAM. Up ove PC Express 
1 laes ad 32-bi033 MHz PCL 
ths are inched, aswell asa Giga 
bt Ethernet port « PCI Express 

Geaphie 16 lane, two Serial ATA 

and one Parallel ATA inerfices ad 

ight USB 20 ports. CRT and LVDS 

output are provided drive high-esola- 
tion monitors and displays, 

‘WindowsXP, Windows XP Enbedde, 
‘Windows 2000 and Linux ate supported, The 
EPXexpmess-CD is RollS-compliant, Prices start t SOO, depending a 
processor sped, 

Kontron America, Poway, CA. (888) 294-4558, (wwwkontron.com} 


©!" Got Gonnactod wit canpars and produc eared ison 
wornstemagazinecom/getcannected 











Performance Beyond Limits 
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ETkexpress products are next generation embedded modules, 
based om the PICMG COM Express standard. express provides 
the hightest performance and /0 bandwidth avatlabe in CMs. 


> Pot press-the elemental data path 
> Gigait Eheret-fortigh connectivity 
> USB2.0-for fat perish 
> Saal ATA fer ast aves 
> APL -for optimized power manogement 


Get ready. Get ETXexpress 


Visit www.kontron.com/ETKexpress 


G kontron 
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snare youR VISION. 
WE'LL SHARE OUR [NNOVATION: 





‘Share your vision with us. We'll customize 
our products to your requirements or partner 
with you to develop custom 
products all the way 
through deployment. 

Either way, you'll 
got leading-edge 
board level 







solutions for the mast demanding 

‘embedded applications. Tell us what 

you need at info@vadatech.com. 

= Comprehensive Hardware/Software 
solution for Inteligent Peripheral 
‘Management Interface (IPM) version 2.0 

= Symmetric Multl-Processing CPU for 
AMC/VME Modules 

= A.complete line of ATCA. 
and AMC carers 

= High dynamic range A/D Converters 











Son =@ 
(C6012 6U CPCI processor blade CP307 3U CPCI processor blade 


> Scalable upto 2.0GH2 Intel® Core™ Duo, FSB 6S7MHz > Scalable upto 2.0GHz Intel® Core™ Duo, FSB 667HH2 
> Upto 465 Dual Channel Memory DOR2 400KH2 > Upto 468 Dual Channel Memory DOR2 657¥HE 

> 4x GhE ports (2x Front, 2x PIONG2, 16) > 2«GHE, up to Gx USBZ.0 ports, WGA/OVT interface 

> PMC orXNC-slot; CompactFlash and SATA 2.5" HOD > 2«SATA, onboard Compactfash 


Greater Capacity. Same Footprint. So Much More Potential. 
Explore te power and the potential of tno cores in one processor with Kontron CompactPCI boards designed with 
the Intel® Core™ Duo processor. Nearly double your processing power and achieve the unpreceden iy to 
tun different applications and OSes on each core using Vix virtualization technology. With so much packaged into 
a sinae slot footprint, t's easy to imagine a whole new world of embedded possbi 


Embed your next system application with Kontron using Intel® Core™ Duo pi 
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